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Chapter  1.  Overview 


Indttd,  you  can  build  a  machlnt  to  draw  dtmonstrcUvt  conclusions  for  you,  but  / 
tfdnk  you  can  never  build  a  machine  that  will  draw  plausible  inferences. 

—  Polya 


1.1.  Abstract  of  this  Thesis 


A  program,  called  "AM",  is  described  which  models  one  aspect  of  elementary  mathematics 
research:  developing  new  concepts  under  the  guidance  of  a  large  body  of  heuristic  rules. 
"Mathematics*  is  considered  ar  a  type  of  intslligent  behavior,  not  as  a  finished  product. 


The  local  heuristics  communicate  via  an  agenda  mechanism,  a  global  list  of  tasks  for  the 
system  to  perform  and  reasons  why  each  task  is  plausible.  A  single  task  might  direct  AM  to 
define  a  new  concept,  or  to  explore  some  facet  of  an  existing  concept,  or  to  examine  some 
empirical  data  for  regularities,  etc.  Repeatedly,  the  program  selects  from  the  agenda  the 
task  having  the  best  supporting  reasons,  and  then  executes  it. 


Each  concept  is  an  active,  structured  knowledge  module.  A  hundred  very  incomplete 
modules  are  initially  provided,  each  one  corresponding  to  an  elementary  set-theoretic 
concept  (e.g.,  union).  This  provides  a  definite  but  immense  "space"  which  AM  begins  to 
explore.  AM  extends  its  knowledge  base,  ultimately  rediscovering  hundreds  of  common 
concepts  (e.g.,  numbers)  and  theorems  (e.g.,  unique  factorization). 


This  approach  to  plausible  inference  contains  great  powers  and  great  limitations. 
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Scientists  oftsn  face  the  difficult  task  of  formulating  nontrivial  research  problems  which  are 
solvable.  In  any  given  branch  of  science,  it  is  usually  easier  to  tackle  a  specific  given 
problem  than  to  propose  Interesting  yet  managable  new  questions  to  investigate.  For 
example,  contrast  solving  the  Missionaries  and  Cannibals  problem  with  the  more  ill-defined 
reasoning  which  led  to  invtnting  it. 


This  thesis  is  concerned  with  creative  theory  formation  in  maiiiematics:  how  to  propose' 
interesting  new  concepts  and  plausible  hypotheses  connecting  them.  The  experimental 
vehicle  of  my  research  is  a  computer  program  called  AM'  Initially,  AM  is  given  the 
definitions  of  115  simple  set-theoretic  concepts  (like  "Delete",  "Equality").  Each  concept  is 
represented  internally  as  a  data  structure  with  a  couple  dozen  slots  or  facets  (like 
"Definition",  "Examples',  "Worth").  Initially,  most  facets  of  most  concepts  are  blank,  and 
AM  uses  a  collection  of  ffSO  heuristics  -  plausible  rules  of  thumb  -  for  guidance,  as  it  tries 
to  fill  in  those  blanks.  Some  heuristics  are  used  to  select  which  specific  facet  of  which  specific 
concept  to  explore  next,  while  others  are  used  to  actually  find  some  appropriate  information 
about  the  chosen  facet.  Other  rules  prompt  AM  to  notice  simple  relationships  between 
known  concepts,  to  define  promising  new  concepts  to  investigate,  and  to  estimate  how 
interesting  each  concept  is. 


1.2.1.  Detour:  Analysis  of  a  discoversi 


Before  discussing  how  to  synthesize  a  new  theory,  consider  briefly  how  to  analyze  one,  how 
to  construct  a  plausible  chain  of  reasoning  which  terminates  in  a  given  discovery.  One  can 
do  this  by  working  backwards,  by  reducing  the  creative  act  to  simpler  and  simpler  creative 
acts.  For  example,  consider  the  concept  of  prime  numbers.  How  might  one  be  led  to  define 
such  a  notion?  Notice  the  following  plausible  strategy: 


"If  f  it  a  function  which  trtnsformt  olonrantt  of  A  into  olomontt  of  B,  and 
B  it  ordorad,  than  consider  /utt  thoia  mambart  of  A  which  ara 
trantformad  into  extremal  alamantt  of  B.  Thit  cot  it  an  intarotting  cubtat 
of  A." 


When  f(x)  means  "divisors  of  x",  and  the  ordering  Is  "by  length",  this  heuristic  says  to 
consider  those  numbers  which  have  a  minimal^  number  of  factors  -  that  is,  the  primes.  So 
this  rule  actually  reduces  our  task  from  "proposing  the  concept  of  prime  numbers"  to  the 
more  elementary  problems  of  "discovering  orderlng-by-length"  and  "Inventing  divlsors-of*. 


But  suppose  we  know  this  general  rule.  "If  f  it  an  Intarasting  function,  contidar  itt  invarta.”  It 


'  Th«  origin«l  MAtnint  of  this  mmmonic  hss  boon  ibindonod.  n*  Exodu*  ttatos:  i  AM  that  I  ^ 

2 

Th*  othor  oxtromo,  numbor*  with  •  MAXIMAL  numbor  of  factor*,  wa*  aho  propotod  by  AM  a*  worth  mvoatigatinc.  Thi*  lod 
AM  to  many  intart  .'tinf  quaation*.  $**  Appendix  4. 
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reduces  the  task  of  discovering  divisors-of  to  the  simpler  task  of  discovering  multiplication®. 
Eventually,  this  task  reduces  to  the  discovery  of  very  basic  notions,  like  substitution,  set- 
union,  and  equality.  To  explain  how  a  given  researcher  might  have  made  a  given 
discovery,  such  an  analysis  is  continued  until  that  inductive  task,  is  reduced  to  "discovering" 
notions  which  the  researcher  already  knew,  which  were  his  conceptual  primitives. 


1.2.2.  What  AM  does:  Syntheses  of  discoveries 


T his  leads  to  the  paradox  that  the  more  original  a  discovery  the  more  obvious  it 
seems  afterwards.  The  creative  act  is  not  an  act  of  creation  in  the  sense  of  the 
Old  T estament.  It  does  not  create  something  out  of  nothing;  it  uncovers,  selects, 
reshuffles,  combines,  synthesizes  already  existing  facts,  faculties,  skills.  The  more 
familiar  the  parts,  the  more  striking  the  new  whole. 

-  KoettUr 


Suppose  a  large  collection  of  these  heuristic  strategies  has  been  assembled  (e.g.,  by  analyzing 
a  great  many  discoveries,  and  writing  down  new  heuristic  rules  whenever  necessary). 
Instead  of  using  them  to  explain  how  a  given  idea  might  have  evolved,  one  can  imagine 
starting  from  a  basic  core  of  knowledge  and  "running"  the  heuristics  to  generate  new 
concepts.  We’re  talking  about  reversing  the  process  described  in  the  last  section;  not  how  to 
explain  discoveries,  but  how  to  make  them. 

Such  syntheses  are  precisely  what  AM  does.  The  program  consists  of  a  large  corpus  of 
primitive  mathematical  concepts,  each  with  a  few  associated  heuristics’’.  AM’s  activities  all 
serve  to  expand  AM  itself,  to  enlarge  upon  a  given  body  of  mathematical  knowledge.  To 
cope  with  the  enormity  of  the  potential  "search  space"  Involved,  AM  uses  its  heuristics  as 
judgmental  criteria  to  guide  development  in  the  most  promising  direction.  It  appears  that 
the  process  of  Inventing  worthwhile  new®  concepts  can  be  guided  successfully  using  a 
collection  of  a  few  hundred  such  heuristics. 

Each  concept  is  represented  as  a  frame-like  data  structure  with  25  different  facets  or  slots. 
The  types  of  facets  include:  Examples,  Definitions,  Generalizations,  Dc  ,iain/Rsnce,  Analogies, 
Interestingneas,  and  many  others.  Modular  representation  of  concepts  provides  a  convenient 
scheme  for  organizing  the  heuristics;  for  example,  the  following  strategy  fits  into  the 
Examples  facet  of  the  Predicate  concept:  "If,  empirically,  10  times  as  many  elements /af/  some 
predicate  P,  as  satisfy  it,  then  some  generalization  (weakened  version)  of  P  might  be  more 
interesting  than  P".  AM  considers  this  suggestion  after  trying  to  fill  in  examples  of  each 
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PkM  noticinc  (hat  ntuHtplicition  it  ittocialivt  and  commutativa 
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Srtut(ion/aet»n  ru(a»  which  function  at  tocal  "plautibl*  movf  ftnaratort".  Sew*  au-jest  taaka  for  the  ayatam  to  carry 
out,  aoew  autcaat  waya  of  aatitfyint  a  |ivan  (atk,  ate 

Typically,  *naw*  meant  new  to  AM,  not  to  Mtnkindi  and  'worthwhile*  can  only  be  judfad  in  hindaifht. 
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prcdicate*. 

AM  is  initially  given  a  collection  of  115  cote  concepts,  with  only  a  few  facets  filled  in  for 
each.  Its  sole  activity  is  to  choose  some  facet  of  some  concept,  and  fill  in  that  particular  slot. 
In  so  doing,  new  notions  will  often  emerge.  Uninteresting  ones  are  forgotten,  mildly 
interesting  ones  are  kept  as  parts  of  one  facet  of  one  concept,  and  very  interesting  ones  are 
granted  full  concept-module  status.  Each  of  these  new  modules  has  dozens  of  blank  slots, 
hence  the  space  of  possible  actions  (blank  facets  to  hi!  in)  grows  rapidly.  The  same 
heuristics  are  used  both  to  suggest  new  directions  for  investigation,  and  to  limit  attention: 
both  to  sprout  and  to  prune. 


I.2.S.  Results 


The  particular  mathematical  domains  in  wh»ch  AM  operates  depend  upon  the  choice  of 
initial  concepts.  Currently,  AM  begins  with  nothing  but  a  scanty  knowledge  of  concepts 
which  Piaget  might  describe  as  prenumtrical:  Sets,  substitution,  operations,  equality,  and  so 
on.  In  particular,  AM  is  not  told  anything  about  proof,  single-valued  functions,  or 
numbers. 

From  this  primitive  basis,  AM  quickly  discovered^  elementary  numerical  concepts 
(corresponding  to  those  we  refer  to  as  natural  numbers,  multiplication,  factors,  and  primes) 
and  wandered  around  in  the  domain  of  elementary  number  theory.  AM  was  not  designed 
to  prove  anything,  but  it  did  conjecture  many  well-known  relationships  (e.g.,  the  unique 
factorization  theorem). 

AM  was  not  able  to  discover  any  "new-to-Mankind"  mathematics  purely  on  its  own,  but  has 
discovered  several  interesting  notions  hitherto  unknown  to  the  author.  A  couple  bits  of  new 
mathematics  have  been  inspired  by  AM.^  A  synergetic  AM-human  combination  can 
sometimes  produce  better  research  than  either  could  alone.*  Although  most  of  the  concepts 
AM  proposed  and  developed  were  already  very  well  known,  AM  defined  some  of  them  in 
novel  ways  (e.g.,  prime  pairs  were  defined  by  restricting  addition  to  primes;  that  is,  for 
which  primes  p,q,r  is  it  possible  that  p*q-r?®). 

Everything  that  AM  does  can  be  viewed  as  testing  the  underlying  body  of  heuristic  rules. 
Gradually,  this  knowledge  becomes  better  organized,  its  implications  clearer.  The  resultant 
body  of  detailed  heuristics  may  be  the  germ  of  a  more  efficient  programme  for  educating 


In  fKl,  tftcr  AM  ttttmpt*  In  find  tximplt*  gf  SET-EQUAHTY,  «o  ftw  »r»  fovnd  thd  AM  dgeidft  to  |•ntr«|j^•  that 
pradicata.  Tha  raiult  ia  tha  craation  gf  a  naw  pradkata  which  maana  ’Haa-tha-aatna-lanith-aa''  —  i.a.,  a 
rudimantary  praouragr  tg  natural  numbara. 

^  'Diacgvarint*  a  egneapt  maana  that  (I)  AM  raegfnizad  it  aa  a  diatinfuiahad  antity  (a|,  by  fnrmulatinf  Ha  dafinition)  and 
alag  (2)  AM  daeidad  H  waa  wgrth  invaati{atin(  (aithar  bacavaa  gf  tha  inlaraatinc  way  H  waa  fgrtnad,  or 
bacauaa  gf  aurpriainf  praliminary  ampirical  raauHa) 

*  Thia  ia  aupportad  by  Galarntar'a  axpariancaa  with  hia  (aomatry  program-  Whila  lacturing  about  how  H  might  prova  a 
cartain  thaoram  about  iaoacaiaa  trianglaa,  ha  eama  up  with  a  naw,  cuta  proof.  SimilarVi  Guard  and  Eaatman 
notkad  an  intarmadiata  raault  of  thair  SAM  raaoMion  thaoram  provar,  and  wiaaly  intarpratad  it  aa  a 
nontrivial  raauH  in  laltka  thaory  (now  known  aa  SAM’a  lamma). 

Tha  anawar  ia  that  aHhar  p  or  q  muat  bo  2,  and  that  tha  othar  two  primaa  ara  a  prima  pair  —  ia.,  thay  diffar  by  two. 
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math  students  than  the  current  dogma*®. 

Another  benefit  of  actually  constructing  AM  is  that  of  experimentation:  one  can  vary  the 
concepts  AM  starts  with,  vary  the  heuristics  available,  etc.,  and  study  the  effects  on  AM’s 
behavior.  Several  such  experiments  were  performed.  One  involved  adding  a  couple  dozen 
new  concepts  from  an  entirely  new  domain:  plane  geometry.  AM  busied  Itself  exploring 
elementary  geometric  concepts,  and  was  almost  as  productive  there  as  in  its  original  domain. 
New  concepts  were  defined,  and  new  conjectures  formulated.  Other  experiments  indicated 
that  AM  was  more  robust  than  anticipated;  it  withstood  many  kinds  of  "de-tuning".  Others 
demonstrated  the  tremendous  impact  that  a  few  key  concepts  (e.g..  Equality)  had  on  AM’s 
behavior.  Several  more  experiments  and  extensions  have  been  planned  for  the  future. 


1.2.4.  Motivation  foptionall 


We  need  a  super-mat  hematics  in  which  the  operations  are  as  unknown  as  the 
quantities  they  operate  on,  and  a  super-mathematician,  who  does  not  know  what 
he  is  doing  when  he  performs  these  operations. 

—  Eddington 


Although  the  motivation  for  carrying  out  this  research  of  course  preceded  the  effort,  I  have 
delayed  until  this  section  a  discussion  of  why  this  is  worthwhile,  why  it  was  attempted. 

First  there  was  the  inherent  interest  of  getting  a  handle  on  scientific  creativity.  AM  Is  partly 
a  demonstration  that  some  aspects  of  creative  theory  formation  can  be  demystified,  can  be 
modelled  as  simple  rule-governed  behavior. 

Related  to  this  is  the  potential  for  learning  from  AM  more  about  the  processes  of  concept 
formation.  This  was  touched  on  previously,  and  several  experiments  already  performed  on 
AM  will  be  detailed  later. 

Third,  AM  Itself  may  grow  into  something  of  pragmatic  value.  Perhaps  it  will  become  a 
useful  tool  for  mathematicians,  for  educators,  or  as  a  model  for  similar  systems  in  more 
"practical"  fields.  Perhaps  in  the  future  we  scientists  will  be  able  to  rely  on  automated 
assistants  to  carry  out  the  "hack"  phases  of  research,  the  tiresome  legwork  necessary  for 
"secondary"  creativity. 

Historically,  the  domain  of  AM  came  from  a  search  for  a  scientific  field  whose  activities  had 
no  specific  goal,  and  in  which  natural  language  abilities  were  unnecessary.  This  was  to  test 
out  the  BEINGS  [Lenat  75b]  ideas  for  a  modular  representation  of  knowledge. 


Curr«ntly,  «n  aAKitor  vtry  b««t  «rork  iny  mitharnttieiin  hat  avar  dona,  polithat  K  until  Ha  brilhanca  ia 

btindin;,  than  pratantt  H  to  tha  atudant  to  induct  upon.  Many  individuala  (af.,  Knuth  and  Polya)  hava 
pointad  out  thit  blundar.  A  faw  (af.,  Papart  at  MU,  Adamt  at  Stanford)  tra  axparimantinf  with  mora 
raaKatic  atratafiat  for  ’taachmf”  craativHy.  Saa  tha  rafaratKaa  by  thaaa  authora  in  tha  bbliofraphy. 
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It  would  be  unfair  not  to  mention  the  usual  bad  reasons  for  this  research;  the  "Look  ma,  no 
hands"  syndrome,  the  AI  researcher’s  classic  maternal  urges,  ego,  the  usual  thesis  drives,  etc. 


I.2.S.  Conclusions 

AM  is  forced  to  judge  o  priori  the  value  of  each  new  concept,  to  lose  interest  quickly  in 
concepts  which  aren’t  going  to  develop  Into  anything.  Often,  such  judgments  can  only  be 
based  on  hindsight.  For  similar  reasons,  AM  has  difficulty  formulating  new  heuristics 
which  are  relevant  to  the  new  concepts  it  creates.  Heuristics  are  often  merely  compiled 
hindsight.  White  AM’s  "approach"  to  empirical  research  may  be  used  In  other  scientific 
domains,  the  main  limitation  (reliance  on  hindsight)  will  probably  recur.  This  prevents 
AM  from  progressing  Indefinitely  far  on  its  own. 

This  ultimate  limitation  was  reached.  AM’s  performace  degraded  more  and  more  as  it 
progressed  further  away  from  its  initial  base  of  concepts.  Nevertheless,  AM  demonstrated 
that  selected  aspects  of  creative  discovery  in  elementary  mathematics  could  be  adequately 
represented  as  a  heuristic  search  process.  Actually  constructing  a  computer  model  of  this 
activity  has  provided  an  experimental  vehicle  for  studying  the  dynamics  of  plausible 
empirical  inference. 


1.3.  Wavs  of  viewing  AM  as  some  common  process 

This  section  will  provide  a  few  metaphors:  some  hints  for  squeezing  AM  into  paradigms 
with  which  the  reader  might  be  familiar,  "'sr  example,  the  existence  of  heuristics  in  AM  is 
functionally  the  same  as  the  presence  of  domain-specific  information  in  any  knowledge- 
based  system. 

Consider  assumptions,  axioms,  definitions,  and  theorems  to  be  syntactic  rules  for  the 
language  that  we  call  Mathematics.  Thus  theorem-proving,  and  the  whole  of  textbook 
mathematics,  is  a  purely  syntactic  process.  Then  the  heuristic  rules  used  by  a 
mathematician  (and  by  AM)  would  correspond  to  the  semantic  knowledge  associated  with 
these  more  formal  methods. 

Just  as  one  can  upgrade  natural-language-understanding  by  incorporating  semantic 
knowledge,  so  AM  is  only  as  successful  as  the  heuristics  it  knows. 

Four  more  ways  of  "viewing"  AM  as  something  else  will  be  provided:  (i)  AM  as  a  hill- 
climber,  (ii)  AM  as  a  heuristic  search  program,  (iii)  AM  as  a  mathematician,  and  (Iv)  AM 
as  a  thesis. 


I.S.I.  AM  as  Hill-climbing 

Let’s  draw  an  analogy  between  the  process  of  developing  new  mathematics  and  the  familiar 
process  of  hill-climbing.  We  may  visualize  AM  as  exploring  a  space  using  a  measuring  or 
evaluation"  function  which  imparts  to  it  a  topography. 
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Consldcr  AM’s  core  of  very  simple  knowledge.  By  compounding  its  known  concepts  and 
methods,  AM  can  explore  beyond  the  frontier  of  this  foundation  a  little  wherever  it  wishes. 
The  incredible  variety  of  alternatives  to  investigate  includes  all  known  mathematics,  much 
trivia,  countless  deadends,  and  so  on.  The  only  "successful"  paths  near  the  core  are  the 
narrow  ridges  of  known  mathematics  (plus  perhaps  a  few  as-yet-undiscovered  isolated 
peaks). 

How  can  AM  walk  through  this  immense  space,  with  any  hope  of  following  the  few,  slender 
trails  of  already-established  mathematics  (or  some  equally  successful  new  fields)?  AM  must 
do  hill-climbing:  As  new  concepts  are  formed,  decide  how  promising  they  are,  and  always 
explore  the  currently  most-promising  new  concept.  The  evaluation  function  is  quite 
nontrivial,  and  this  thesis  may  be  viewed  as  an  attempt  to  study  and  explain  and  duplicate 
the  Judgmental  criteria  people  employ.  Preliminary  attempts"  at  codifying  such 
"mysterious"  emotive  forces  as  Intuition,  aesthetics,  utility,  richness,  interestingness, 
relevance...  indicated  that  a  large  but  not  unmanageable  collection  of  heuristic  rules  should 
suffice. 

The  important  visualization  to  make  is  that  with  proper  evaluation  criteria,  AM’s  planar 
mass  of  interrelated  concepts  is  transformed  into  a  three-dimensional  relief  map:  the  known 
lines  of  development  become  mountain  ranges,  soaring  above  the  vast  flat  plains  of  trivia 
and  inconsistency  below. 

Occasionally  an  isolated  hill  is  discovered  near  the  core;'^  certainly  whole  ranges  lie 
undiscovered  for  long  periods  of  time’^  and  the  terrrain  far  from  the  initial  core  is  not  yet 
explored  at  all. 


Z.  AM  as  Heuristic  Search 


As  the  title  of  this  section  -  and  this  thesis  -  proclaims,  AM  is  a  kind  of  "heuristic  search" 
program.  That  must  mean  that  AM  is  exploring  a  particular  "space,"  using  some  informal 
evaluation  criteria  to  guide  it. 


The  flavor  of  search  which  is  used  here  is  that  of  progressively  enlarging  a  tree.  Certain 
"evaluation-function"  heuristics  are  used  to  decide  which  node  of  the  tree  to  expand  next, 
and  other  guiding  rules  are  then  used  to  produce  from  that  node  a  few  Interesting  successor 
nodes.  To  do  mathematical  research  well,  I  claim  that  it  is  necessary  and  sufficent  to  have 
good  methods  for  proposing  new  concepts  from  existing  ones,  and  for  deciding  how 
interesting  each  "node"  (partially-studied  concept)  is. 

AM  is  initially  supplied  with  a  few  facts  about  some  simple  math  concepts.  AM  then 


Th««a  took  tho  form  of  informal  timuittiont.  Althoufh  far  from  eontroNod  oxporimonta,  thoy  indicatad  tha  faaaability  of 
attamptinf  to  craata  AM,  by  yialdinf  an  approximata  fifura  for  tha  amount  of  informal  knowladfa  aueh  a 
ayatam  arould  naad 
19 

E.|.,  Comxay’a  numbara,  at  daaerfead  in  [knuth  74}. 
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Ef.,  non-Euclidaan  gaomatriaa  araran't  theufht  of  until  IMS. 


s  -v-v. '  v-*\  i.’.  I  V  '.■a’.-g'j  x-'j  vv'-'yi  \r. 


Chipttr  I  AM:  Ditcoviry  in  UtthtiMlic*  it  Hiuriitic  Siirch  -8- 

explores  mathematics  by  selectively  enlarging  that  basis.  One  could  say  that  AM  consists  of 
an  active  body  of  mathematical  concepts,  plus  enough  "wisdom"  to  use  and  develop  them 
effectively.  For  "wisdom",  read  "heuristics".  Loosely  speaking,  then,  AM  is  a  heuristic  search 
program.  To  see  this  more  clearly,  we  must  explain  what  the  nodes  of  AM’s  search  space 
are,  what  the  successor  operators  or  links  are,  and  what  the  evaluation  function  is. 

AM’s  space  can  be  considered  to  consist  of  all  nodes  which  are  consistent,  partially-hlled-in 
concepts.  Then  a  primitive  "legal  move"  for  AM  would  be  to  (i)  enlarge  some  facet  of  some 
concept,  or  (ii)  create  a  new,  partially-complete  concept.  Consider  momentarily  the  size  of 
this  space.  If  there  were  no  constraint  on  what  the  new  concepts  can  be,  and  no  informal 
knowledge  for  quickly  finding  entries  for  a  desired  facet,  a  blind  "legal-move"  program 
would  go  nowhere  -  slowly!  One  shouldn’t  even  call  the  activity  such  a  progfram  would  be 
doing  "math  research." 

The  heuristic  rules  are  used  as  little  "plausible  move  generators".  They  suggest  Wtdch  facet 
of  which  concept  to  enlarge  next,  and  they  suggest  specific  new  concepts  to  create.  The  only 
activities  which  AM  will  consider  doing  are  those  which  have  been  motivated  for  some 
specific  good'^  reason.  A  global  agenda  of  tasks  is  maintained,  listing  alt  the  activities 
suggested  but  not  yet  worked  on. 

AM  has  a  definite  algorithm  for  rating  the  nodes  of  its  space.  Many  heuristics  exist  merely 
to  estimate  the  worth  of  any  given  concept.  Other  heuristics  use  these  worth  ratings  to 
order  the  tasks  on  the  global  agenda  list.  Yet  AM  has  no  specific  goal  criteria:  it  can  never 
"halt",  never  succeed  or  fall  in  any  absolute  sense.  AM  goes  on  forever'®. 

Consider  Nilsson’s  descriptions  of  depth-first  searching  and  breadth-first  searching  ([Nilsson 
71]).  He  has  us  maintain  a  list  of  "open"  nodes.  Repeatedly,  he  plucks  the  top  one  and 
expands  it.  In  the  process,  some  new  nodes  may  be  added  to  the  Open  list.  In  the  case  of 
depth-first  searching,  they  are  added  at  the  top;  the  next  node  to  expand  is  the  one  most 
recently  created;  the  Open-list  is  being  used  as  a  push-down  stack.  For  breadth-first  search, 
new  nodes  are  added  at  the  bottom;  they  aren’t  expanded  until  all  the  older  nodes  have 
been;  the  Open-list  is  used  as  a  queue.  For  heuristic  search,  or  "best-first"  search,  new  nodes 
are  evaluat^  In  some  numeric  way,  and  then  "merged"  Into  the  already-sorted  list  of  Open 
nodes. 

This  process  is  very  similar  to  the  agenda  mechanism  AM  uses  to  manage  its  search.  This 
will  be  discussed  in  detail  in  Chapter  3.  Each  entry  on  the  agenda  consists  of  three  parts: 
(i)  a  plausible  task  for  AM  to  do,  (ii)  a  list  of  reasons  supporting  that  task,  and  (iii)  a 
numeric  estimate  of  the  overall  priority  this  task  should  have.  When  a  task  is  suggested  for 
some  reason,  it  is  added  to  the  agenda.  A  task  may  be  suggested  several  times,  for  different 
reasons.  The  global  priority  value  assigned  to  each  task  is  based  on  the  combined  value  of 
Its  reasons.  The  control  structure  of  AM  is  simply  to  select  the  task  with  the  highest 
priority,  execute  it,  and  select  a  new  one.  The  agenda  mechanism  appears  to  be  a  very  well- 
suited  data  structure  for  managing  a  "best-first"  search  process. 


14 


15 


Of  court*,  AM  think*  t  rttton  i*  ”(ood*  if  --  incf  only  if  ~  it  w*»  told  thit  by  •  h*uri*tic  rul*i  *0  tho«*  rul**  hid  b*tt*r 
b*  pl*uti>l*,  ortt-iribly  th*  on**  ictuilly  u**«)  by  th*  *xp*rt*. 

T*chnie*lly,  for*v*r  i*  ibout  100,000  li*t  c*ll*  ind  t  eoupl*  cpv  hour*. 


Ch*pt*r  I 


AM:  Ditcovtry  in  MntlMinitiet  t*  Hturwtie  S««reh 


-9- 


Similar  control  structures  were  used  in  LT  [Newell,  Shaw,  &  Simon  57],  the  predictor  part 
of  Dendral  [Buchanan  et  ai  69],  SIMULA-67  [Dahl  68],  and  KRL  [Bobrow  &  Winograd 
77],  The  main  difference  is  that  in  AM,  symbolic  reasons  are  used  (albeit  in  trivial  token- 
like  ways)  to  decide  whether  -  and  how  much  -  to  boost  the  priority  of  a  task  when  it  is 
suggested  again. 

There  are  several  difficulties  and  anomalies  in  forcing  AM  into  the  heuristic  search 
paradigm.  In  a  typical  heuristic  search  (e.g.,  Dendral  [Feigenbaum  et  al  71],  Meta-Dendral 
[Buchanan  et  al  72],  most  game-playing  programs  [Samuel  67]),  a  "search  space"  is  defined 
implicitly  by  a  "legal  move  generator".  Heuristics  are  present  to  constrain  that  generator  so 
that  only  plausible  nodes  are  produced.  The  second  kind  of  heuristic  search,  of  which  AM 
is  an  example,  contains  no  "legal  move  generator".  Instead,  AM’s  heuristics  are  used  as 
plau.’.ible  move  generators.  Those  heuristics  themselves  implicitly  define  the  possible  tasks 
AM  might  consider,  and  all  such  tasks  should  be  plausible  one.  In  the  first  kind  of  search, 
removing  a  heuristic  widens  the  search  space;  in  AM’s  kind  of  search,  removing  a  heuristic 
reducts  it. 

Another  anomaly  is  that  the  operators  which  AM  uses  to  enlarge  and  explore  the  space  of 
concepts  are  themselves  mathematical  concepts  (e.g.,  some  heuristic  rules  result  in  the 
creation  of  new  heuristic  rules;  "Compose"  is  both  a  concept  and  an  operation  which  results 
in  new  concepts).  Thus  AM  should  be  viewed  as  a  mass  of  knowledge  which  enlarges  Itself 
repeatedly.  Typically,  computer  programs  keep  the  Information  they  "discover"  quite 
separate  from  the  knowledge  they  use  to  make  discoveries'* 

Perhaps  the  greatest  difference  between  AM  and  typical  heuristic  search  procedures  is  that 
AM  has  no  well-defined  target  concepts  or  target  relationships.  Rather,  its  "goal  criterion"  - 
its  sole  aim  -  is  to  maximize  the  interestingness  level  of  the  activities  it  performs,  the 
priority  ratings  of  the  top  tasks  on  the  agenda.  It  doesn’t  matter  precisely  which  definitions 
or  conjectures  AM  discovers  -  or  misses  -  so  long  as  it  spends  its  time  on  plausible  tasks. 
There  is  no  fixed  set  of  theorems  that  AM  should  discover,  so  AM  is  not  a  typical  problem- 
solver.  There  is  no  fixed  set  of  traps  AM  should  avoid,  no  small  set  of  legal  moves,  and  no 
winning/losing  behavior,  so  AM  is  not  a  typical  game-player. 

For  example,  no  stigma  is  attached  to  the  fact  that  AM  never  discovered  real  numbers'^;  it 
was  rather  surprising  that  AM  managed  to  discover  natural  numbers!  Even  if  it  hadn’t 
done  that,  it  would  have  been  acceptable'*  if  AM  had  simply  gone  off  and  developed  ideas 
in  set  theory. 


Of  court*  thi*  it  ofttn  btetut*  1h«  two  kind*  of  knowl*d|*  tr*  v*ry  difftrtnt.  For  •  ch«tt-pl«y*r,  th*  firtt  kind  it 
'food  botrd  potitiont,'  tnd  th*  tteond  it  ’*trtt*fi**  for  mtkinf  •  food  mov*.'  Th*or*m-prov*r*  tr*  tn 
*xc*ptionL  Th«y  produc*  *  now  th«or*m,  tnd  th*n  ut*  H  (ilmott  lik*  t  now  oporttor)  in  future  proof*.  A 
profrim  to  lairn  to  pity  chtcktr*  (Stmutl  (7j  h**  thi*  *>m*  flavor,  thoroby  indieitinf  that  thi*  '**lf-h«lp’ 
property  t*  not  a  function  of  th*  taak  domain,  net  aimply  a  charactariatic  of  mathtmatic*. 

' Thor*  ar*  many  "nico"  thinf*  which  AM  didn’t  —  and  can’t  ~  do.  of.,  doviainf  laomotric  concept*  from  H*  initial  •impi* 
**t-th*or*tic  knowlodf*.  S«*  th*  dtacuaaion  of  th*  limitation*  of  AM,  Section  7.2. 

'*  Acceptable  to  whom?  I*  there  really  a  domain-invariant  criterion  for  judfinf  th*  quality  of  AM**  action*^  See  the 
diacueaion*  in  Section  7.1 
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l.S.S.  AM  as  a  Mathematician 


Before  diving  into  the  innards  of  AM,  let’s  take  a  moment  to  discuss  the  totality  of  the 
mathematics  which  AM  carried  out.  Like  a  contemporary  historian  summarizing  the  work 
of  the  Babylonian  mathematicians,  we  shan’t  hesitate  to  use  current  terms  and  criticize  by 
current  standards. 

AM  began  its  investigations  with  scanty  knowledge  of  a  few  set-theoretic  concepts  (sets, 
equality  of  sets,  set  operations).  Most  of  the  obvious  set-theory  relations  (e.g.,  de  Morgan’s 
laws)  were  eventually  uncovered;  since  AM  never  fully  understood  abstract  algebra,  the 
statement  and  verification  of  each  of  these  was  quite  obscure.  AM  never  derived  a  formal 
notion  of  infinity,  but  it  naively  established  conjectures  like  "a  set  can  never  be  a  member  of 
itself",  and  procedures  for  making  chains  of  new  sets  ("insert  a  set  into  Itself").  No 
sophisticated  set  theory  (e.g.,  diagonalization)  was  ever  done. 

After  this  initial  period  of  exploration,  AM  decided  that  "equality"  was  worth  generalizing, 
and  thereby  discovered  the  relation  "same-sizc-as".  "Natural  numbers"  were  based  on  this, 
and  soon  most  simple  arithmetic  operations  were  defined. 

Since  addition  arose  as  an  analog  to  union,  and  multiplication  as  a  repeated  substitution 
followed  by  a  generalized  kind  of  unioning*^  it  came  as  quite  a  surprise  when  AM  noticed 
that  they  were  related  (namely,  N+N-2xN).  AM  later  re-discovered  multiplication  in  three 
other  ways:  as  repeated  addition,  as  the  numeric  analog  of  the  Cartesian  product  of  sets, 
and  by  studying  the  cardinality  of  power  sets^.  These  operations  were  defined  in  different 
ways,  so  it  was  an  unexpected  (to  AM)  discovery  when  they  all  turned  out  to  be  equivalent. 
These  surprises  caused  AM  to  give  the  concept  Times’  quite  a  high  Worth  rating. 

Exponentiation  was  defined  as  repeated  multiplication.  Unfortunately,  AM  never  found  any 
obvious  properties  of  exponentiation,  hence  lost  all  interest  in  it. 

Soon  after  defining  multiplication,  AM  investigated  the  process  of  multiplying  a  number  by 
itself:  squaring.  The  inverse  of  this  turned  out  to  be  Interesting,  arid  led  to  the  definition  of 
square-root.  AM  remained  content  to  play  around  with  the  concept  of  fnfeger-square-root. 
Although  it  isolated  the  set  of  numbers  which  had  no  square  root,  AM  was  never  close  to 
discovering  rationals,  let  alone  irrationals. 

Raising  to  fourth -powers,  and  fourth-rooting,  were  discovered  at  this  time.  Perfect  squares 
and  perfect  fourth-powers  were  isolated.  Many  other  numeric  operations  and  kinds  of 
numbers  were  isolated:  Odds,  Evens,  Doubling,  Halving,  etc.  Primitive  notions  of  numeric 
Inequality  were  defined  but  AM  never  even  discovered  Trichotomy. 

The  associativity  and  commutativity  of  multiplication  indicated  that  it  could  accept  a  BAG 


1 9 

Ttkt  two  b«gt  A  and  B.  Raplaca  «aeh  alamant  of  A  by  tht  bag  B.  Ramovt  on*  ttval  of  par*n1h*t**  by  taking  th*  union  of 
ai!  slamanta  of  tha  tranafigurad  bag  A.  Than  that  naw  bag  will  hav*  at  many  alamani*  a*  th*  product  of  th* 
langth*  of  th*  two  original  bag*. 

20  S 

Th*  aiz*  of  th*  aat  of  all  aubaat*  of  S  I*  2  .  Thu*  th*  powar  tat  of  AUB  hat  langth  aqual  to  th*  product  of  th*  langth* 
of  th*  powar  aatt  of  A  and  B  individually  (atauming  A  and  B  ar*  dit^oint). 
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of  numbers  as  its  argument.  When  AM  defined  the  inverse  operation  corresponding  to 
Times,  this  property  allowed  the  definition  to  be:  "any  bag  of  numbers  (>1)  whose  product  is 
x".  This  was  just  the  notion  of  factoring  a  number  x.  Minimally-factorable  numbers 
turned  out  to  be  what  we  call  primes.  Maximally-factorable  numbers  were  also  thought  to 
be  interesting. 

Prime  pairs  were  discovered  in  a  bizarre  way:  by  restricting  addition  (its  arguments  and  its 
values)  to  Primes.*’  AM  conjectured  the  fundamental  theorem  of  arith.metic  (unique 
factorization  into  primes)  and  Goldbach’s  conjecture  (every  even  number  >2  is  the  sum  of 
two  primes)  in  a  surprisingly  symmetric  way.  The  unary  representation  of  numbers  gave 
way  to  a  representation  as  a  bag  of  primes  (based  on  unique  factorization),  but  AM  never 
thought  of  exponential  notation.  **  Since  the  key  concepts  of  remainder,  greater-than,  gcd, 
and  exponentiation  were  never  mastered,  progress  in  number  theory  was  arrested. 

When  a  new  base  of  geometric  concepts  was  added,  AM  began  finding  some  more  general 
associations.  In  place  of  the  strict  definitions  for  the  equality  of  lines,  angles,  and  triangles, 
came  new  definitions  of  concepts  we  refer  to  as  Parallel,  Equal-measure,  Similar,  Congruent, 
Translation,  Rotation,  plus  many  which  have  no  common  name  (e.g.  the  relationship  of  two 
triangles  sharing  a  common  angle).  A  cute  geometric  interpretation  of  Goldbach’s 
conjecture  was  found**.  Lacking  a  geometry  "model"  (an  analogic  representation  like  the 
one  Gelernter  employed),  AM  was  doomed  to  failure  with  respect  to  proposing  only 
plausible  geometric  conjectures. 

Similar  restrictions  due  to  poor  "visualization"  abilities  would  crop  up  in  topology.  The 
concepts  of  continuity.  Infinity,  and  measure  would  have  to  be  fed  to  AM  before  it  could 
enter  the  domains  of  analysis.  More  and  more  drastic  changes  in  its  initial  base  would  be 
required,  as  the  desired  domain  gets  further  and  further  from  simple  finite  set  theory  and 
elementary  number  theory. 


21 

Th«t  m,  con«id«r  th«  tit  of  triplit  p,q,r,  ill  prittnt,  for  which  p«q>r.  Thin  om  of  thiin  mutt  bi  *2*,  tnd  thi  othir  two 
mutt  thirifori  form  i  primi  piir. 

22 

A  tcnfintiil  noti-  All  of  thi  ditcoviriii  mintionid  ibovi  win  mtdi  by  AM  workinf  by  itiilf,  with  •  human  biinc 
obtarvinf  Hi  bihtvior.  If  thi  livil  of  lophiiticttion  of  AM'i  concaptt  win  hifhir  (or  thi  Mvil  of 
aophiiticition  of  Ht  utari  wan  lowir),  than  it  mifht  bi  worthwhila  to  davalop  a  nka  uaar— ayatam 
intarfaca.  Tha  utar  in  that  cata  could  -•  and  ou|ht  to  —  work  rifht  alonf  with  AM  at  a  eo-rataarchir. 

**  Givan  all  anflaa  of  a  prima  numbar  of  da(raat,  (0,1, 2,3, 5, 7,)  I ,..,179  dainat),  than  any  anfla  batwaan  0  and  ISO 
dafraaa  can  ba  approxhnatad  (to  within  I  dafna)  at  tha  turn  of  two  of  thoaa  anflaa. 
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1.3.4.  AM  as  «  Thesis  foptionall 


Walking  home  along  a  deserted  street  late  at  night,  the  reader  may  imagine 
himself  to  feel  in  the  small  of  his  back  a  cold,  hard  object;  and  to  hear  the  words 
spoken  behind  him,  'Easy  now.  This  is  a  stick-up.  Hand  over  your  money.'  What 
does  the  reader  do7  He  attempts  to  generate  the  utterance.  He  says  to  himself, 
now  if  /  were  standing  behind  someone  holding  a  cold,  hard  object  against  his 
back,  what  would  make  me  say  thatf  What  would  I  mean  by  it?  The  reader  is 
advised  tnct  he  can  only  arrive  at  the  deep  structure  of  this  b^k,  and  through  the 
deep  structure  the  semantics,  if  he  attempts  to  generate  the  book  for  himself.  T he 
author  wLhes  Aim  luck. 


“  Lincltrholm 


Don’t  be  scared  by  the  weight  of  the  document  you’re  now  holding.  If  you  flip  to  page  165, 
you’ll  see  that  the  last  two-thirds  are  just  appendices. 

Each  chapter  is  of  roughly  equal  importance,  which  explains  the  huge  variation  in  length. 
Start  looking  over  Chapter  2  right  away:  it  contains  a  detailed  example  of  what  AM  does. 
Since  you’re  reading  this  sentence  now,  we’ll  assume  that  you  want  a  preview  of  what’s  to 
come  in  the  rest  of  this  document. 

Chapter  3  covers  the  top-level  control  structure  of  the  system,  which  is  based  around  the 
notion  of  an  ‘agenda’  of  tasks  to  perform.  In  Chapter  4  the  low-level  control  structure  is 
revealed:  AM  is  really  guided  by  a  mass  of  heuristic  rules  of  varying  generality.  Chapter  5 
contains  more  than  you  want  to  know  about  the  representation  of  knowledge  In  AM.  The 
diagram  showing  some  of  AM’s  starting  concepts  (page  105)  is  worth  a  look,  even  out  of 
context. 

Most  of  the  results  of  the  project  are  presented  in  Chapter  6.  In  addition  to  simply  ‘running’ 
AM,  several  experiments  have  been  conducted  with  It.  It’s  av'kward  to  evaluate  AM,  and 
therefore  Chapter  7  is  quite  long  and  detailed. 

The  appendices  provide  material  which  supplements  the  text.  Appendix  2  contains  a 
description  of  ail  the  initial  concepts,  some  examples  of  how  they  were  coded  into  Lisp,  and 
a  partial  list  of  the  concepts  AM  defined  and  investigated  along  the  way.  Appendix  3 
exhibits  ail  242  heuristics  that  AM  is  explicitly  provided  Mth.  Appendix  4  is  essentially  a 
math  article,  about  the  major  discovery  that  AM  motivated:  rnaximally-divisible  numbers. 
Finally,  Appendix  5  contains  traces  of  AM  in  action;  a  long  prose  description,  a  long  task- 
by-task  description,  and  a  long  undoctored  transcript  excerpt.  Appendix  1  hasn’t  been 
mentioned  yet,  and  forms  the  subject  of  the  remainder  of  this  section. 

This  thesis  -  and  its  readers  -  must  come  to  grips  with  a  very  interdisciplinary  problem. 
For  the  reader  whose  background  is  in  Artificial  Intelligence,  most  of  the  system’s  actions  - 
the  "mathematics"  it  does  -  may  seem  inherently  uninteresting.  For  the  mathematician,  the 
word  "LISP"  signifies  nothing  beyond  a  speech  Impediment  (to  Artificial  Intelligence  types  it 
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also  connotes  a  programming  impediment).  If  I  don’t  describe  "LISP"  the  first  time  I 
mention  it,  a  large  fraction  of  potential  readers  will  never  realize  that  potential.  If  I  do  step 
to  describe  LISP,  the  other  readers  will  be  bored. 

In  an  attempt  not  to  lose  readers  due  to  jargon,  two  glossaries  of  terms  have  been  compiled. 
Appendix  1.1  (p.  165)  contains  capsule  descriptions  of  the  mathematical  terms,  ideas,  and 
notations  used  in  this  thesis.  Appendix  1.2  renders  the  analogous  service  for  .'<,rtificial 
Intelligence  jargon  and  computer  science  concepts. 


Chapter  2.  An  Example:  Discovering  Prime  Numbers 

Tir''  '  Ii"'-— I 


This  chapter  wtll  present  an  example  of  AM  in  action,  an  excerpt  from  the  output  of  AM, 
as  it  investigates  seme  concepts. 

After  a  brief  discussion  of  AM's  control  structure  in  Section  2.1,  the  reader  will  be  told 
what  the  point  of  this  example  is  -  and  is  not.  Section  2.3  provides  a  few  eleventh-hour 
hints  at  decoding  the  example. 

The  excerpt  itself  follows  in  Section  2.4.  It  skips  the  first  half  of  the  session,  and  picks  up 
at  a  point  just  after  AM  has  defined  the  concept  "Divisors-of".  Soon  afterward,  AM  defines 
Primes,  and  begins  to  find  interesting  conjectures  related  to  them.  The  excerpt  goes  on  to 
show  how  AM  conjectured  the  fundamental  theorem  of  arithmetic  and  Goldbach’s 
conjecture.  AM  derived  the  notion  of  partitioning  a  collection  of  n  objects  into  smaller 
bundles,  but  failed  to  find  any  interesting  conjectures  about  that  process.  Instead,  AM  was 
side-tracked  into  the  (probably)  fruitless  investigation  of  numbers  which  can  be  represented 
as  the  sum  of  two  primes  in  one  unique  way. 

The  finM  section  of  this  chapter  will  recap  this  example  the  way  a  math  historian  might 
report  it. 


2.1.  Discussion  of  the  AM  Program 


2.1.1.  Representation 

AM  is  a  program  which  expands  a  knowledge  base  of  mathematical  concepts.  Each  concept 
is  stored  as  a  particular  kind  of  data  structure,  namely  as  a  collection  of  properties  or 
"facets"  of  the  concept.  For  example,  here  is  a  miniature  example  of  a  concept’: 


Th«  rifM  arrow  {’■*“)  in  th«  box  on  tht  n«xt  p(|«  i*  th«  symbol  for  "implits*.  ’Not.*  it  tn  tbbrtvittion  for  ’Numbtrt*.  The 
vtrtkil  btr  'I*  h,  t  tymbol  for  tht  prt(f<:.ttt  'dividtt  tvtniy  into*  tht  hook  *-*  it  «  symbol  for  the  prtdktte 
*tht  ntgitien  of*.  *«*  indkattt  txclottvt  or,  and  tht  tymbol  *V*  it  raid  ’for  all*.  PItatt  consult  tht 
llotttry,  Apptndhc  1.1,  for  fullar  ditevttion  of  thtta,  plus  othar  math  tarmt  ll(t  ’Primo  pairs*. 


r- 


. o  ‘  ~  1 
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NAME:  Prim*  Numbars 
{DEFINITIONS: 

ORIGIN:  Numb«r-of‘'divlsorfof(x)  ■  2 
PREDICATE-CALCULUS:  Prim*(x)  ■  (Vz)(2|x  -♦  2»l  •  i«x) 

ITERATIVE:  (for  x>l):  For  I  from  2  to  SqrKx),  -(l|x) 

EXAMPLES:  2,  3,  5,  7,  1 1,  13,  17 
BOUNDARY:  2,  3 
BOUNDARY-FAILURES:  0,  1 
FAILURES:  12 

GENERALIZATIONS:  Nos.,  Nos.  with  an  ovan  nc.  of  divisors.  Nos.  with  a  prim*  no.  of  divisor* 

SPECIALIZATIONS:  Odd  Prim*s,  Prim*  Pairs,  Prim*  Uniqu*ly-addsbl*s 

CONJECS:  Uniqu*  factorization,  GoldbMh's  eonjoctur*,  Extr*m*s  of  Numb*r-of-divisort-of 

INTU'S:  j1  metaphor  to  the  effect  that  Primes  are  the  building  blocks  of  all  numbers 

ANALOGIES: 

Maximally-divisibI*  numbors  ar*  convors*  *xtr*m*s  of  Numb*r-of-divi*ori»of 
Factor  •  non-simpi*  group  into  timpl*  groups 

INTEREST:  Conj*ctur*s  tying  Primos  to  TIMES,  to  Divisor*-of,  to  closoly  r*lst*d  operations 
WORTH:  800 


"Creating  a  new  concept"  is  a  well-deilned  activity:  it  involves  setting  up  a  new  data 
structure  like  the  one  above,  and  filling  in  entries  for  some  of  Its  facets  or  slots.  Filling  in  a 
particular  facet  of  a  particular  concept  is  also  quite  well-defined,  and  is  accomplished  by 
executing  a  collection  of  relevant  heuristic  rules.  This  process  will  be  described  in  great 
detail  in  later  chapters. 


Agenda  and  Heuristics 


An  agenda  of  plausible  tasks  is  maintained  by  AM.  A  typical  task  is  "Fill-in  *xampt*s  of 
Prim**".  The  agenda  may  contain  hundreds  of  entries  such  as  this  one.  AM  repeatedly 
selects  the  top  task  from  the  agenda  and  tries  to  carry  It  out.  This  is  the  whole  control 
structure!  Of  course,  we  must  stilt  explain  how  AM  creates  plausible  new  tasks  to  place  on 
the  agenda,  how  AM  decides  which  task  will  be  the  best  one  to  execute  next,  and  how  it 
carries  out  a  task. 
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If  the  task  Is  "Fill  in  new  Algorithm*  for  Set-union",  then  satisfying  It  would  mean  actually 
synthesizing  some  new  procedures,  some  new  LISP  code  capable  of  forming  the  union  of 
any  two  sets.  A  heuristic  rule  is  relevant  to  a  task  iff  executing  that  rule  brings  AM  closer 
to  satisfying  that  task.  Relevance  is  determined  a  priori  by  where  the  rule  is  stored.  A  rule 
tacked  onto  the  Domain/range  facet  of  the  Compose  concept  would  be  presumed  relevant  to 
the  task  "Chock  ih«  Domiin/rango  of  IncortoDoloto". 

Once  a  task  is  chosen  from  the  agenda,  AM  gathers  some  heuristic  rules  which  might  be 
relevant  to  satisfying  that  task.  They  are  executed,  and  then  AM  picks  a  new  task.  While 
a  rule  is  executing,  three  kinds  of  actions  or  effects  can  occur: 

(i)  Facets  of  some  concepts  can  get  filled  in  (e.g.,  examples  of  primes  may  actually  be  found 

ar.d  ta:ked  onto  the  "Examples"  facet  of  the  "Primes"  concept).  A  typical  heuristic  rule 
which  might  have  this  effect  is: 

To  fill  in  oxaniplos  of  X,  whoro  X  is  •  kind  of  Y  (for  tom*  moro  gonoral  concopt  Y), 
Chock  tho  oxsmplot  of  Y;  somo  of  thorn  may  bo  oxamplos  of  X  at  woll. 

For  the  task  of  filling  in  examples  of  Primes,  this  rule  would  have  AM  notice  that 
Primes  is  a  kind  of  Number,  and  therefore  look  over  all  the  known  examples  of 
Number.  Some  of  those  would  be  primes,  and  would  be  transferred  to  the  Examples 
facet  of  Primes. 

(ii)  New  concepts  may  be  created  (e.g.,  the  concept  "primes  which  are  uniquely  representable 

as  the  sum  of  two  other  primes"  may  be  somehow  be  deemed  worth  studying).  A 
typical  heuristic  rule  which  might  result  in  this  new  concept  is: 

If  seme  (but  not  most)  txamplas  of  X  are  also  oxampits  of  Y  (for  some  concept  Y), 
Create  a  new  concept  defined  as  the  intersection  of  those  2  concepts  (X  and  Y). 

Suppo.se  AM  has  already  isolated  the  concept  of  being  representable  as  the  sum  of  two 
primes  in  only  one  way  (AM  actually  calls  such  numbers  "Uniquely-prime-addabte 
numbers").  W.’ien  AM  notices  that  some  primes  are  in  this  set,  the  above  rule  will 
create  a  brand  new  concept,  defined  as  the  set  of  numbers  which  are  both  prime  and 
uniquely  prime  addable. 

(ill)  New  tasks  may  be  adder!  to  the  agenda  (e.g.,  the  current  activity  may  suggest  that  the 
following  task  is  worth  considering:  "Generalize  the  concept  of  prime  numbers").  A 
typical  heuristic  rule  which  might  have  this  effect  Is: 

If  vory  f«w  oxamplos  of  X  iro  found, 

Then  odd  th*  following  task  to  tho  agonda:  "Genoralizo  tho  concept  X". 

Of  course,  AM  contains  a  precise  meaning  for  the  phrase  "very  few".  When  AM  looks 
for  primes  among  examples  of  already-known  kinds  of  numbers,  it  will  find  dozens  of 
non-examples  for  every  example  of  a  prime  it  uncovers.  "Very  few"  is  thus  naturally 
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implcmented  as  a  Uatistical  confidence  level^. 

The  concept  of  an  agetss'-a  is  certainly  not  new:  schedulers  have  been  around  for  a  long 
time.  But  one  importas/  feature  of  AM’s  agenda  scheme  is  a  new  idea:  attaching  -  and 
using  -  a  list  of  quaisi-symbolic^  reasons  to  each  task  which  explain  why  the  task  is  worth 
considering,  why  it’s  plausible.  It  Is  the  responsibility  of  the  heuristic  rules  to  include  reasons 
for  any  tasks  they  propose.^  For  example,  let's  reconsider  the  heuristic  rule  mentioned  in  (lii) 
above.  It  really  looks  more  like  the  following: 

If  very  few  examples  of  X  are  found, 

Then  add  the  followinf  task  to  the  agenda:  "Generalize  the  concept  X’',  for  the  following 
reason:  "X’s  are  quite  rare;  e  slightly  less  restrictive  concept  might  be  more 
interesting". 


If  the  same  task  is  proposed  by  several  rules,  then  several  different  reasons  for  it  may  be 
present.  In  addition,  one  ephemeral  reason  also  exists:  "Focus  of  attention".  Any  tasks 
which  are  similar  to  the  one  last  executed  get  "Focus  of  attention"  as  a  bonus  reason.  AM 
uses  all  these  reasons,  e.g.  to  decide  how  to  rank  the  tasks  on  the  agenda.  The 
"intelligence"  AM  exhibits  is  not  so  much  "what  it  does",  but  rather  the  order  in  which  it 
arranges  its  agenda®.  AM  uses  the  list  of  reasons  In  another  way:  Once  a  task  has  been 
selected,  the  quality  of  the  reasons  is  used  to  decide  how  much  time  and  space  the  task  will 
be  permitted  to  absorb,  before  AM  quits  and  moves  on  to  a  new  task.  This  whole 
mechanism  will  be  detailed  in  Section  3.3.2,  on  Page  33. 


2.2.  What  to  get  out  of  —  and  NOT  gret  out  of  —  this  example 

The  purpose  of  the  example  which  begins  on  page  20  is  to  convey  a  bit  of  AM’s  flavor. 
After  reading  through  It,  the  reader  should  be  convinced  that  AM  is  not  a  theorem-prover, 
nor  is  it  randomly  manipulating  entries  in  a  knowledge  base,  nor  is  it  exhaustively 
manipulating  or  searching.  AM  is  carefully  growing  a  network  of  data  structures 
representing  mathematical  concepts,  by  repeatedly  using  heuristics  both  (a)  for  guidance  in 
choosing  a  task  to  work  on  next,  and  (b)  to  provide  methods  to  satisfy  the  chosen  task. 


2 

Th«  ratio  of  txamplot  found  to  non-aiamplaa  atumblad  ovar  Not  batwaan  .00!  and  .05.  Philoaophara  outra|ad  by  thia  may 
ba  aomawhat  appaatad  by  knowhdia  that  larga  chanfaa  in  tha  pracita  numbara  vary  raraly  altar  AM'a 
bahavior 

3 

Each  raaaon  la  an  Enfliah  aantanca.  Whila  AM  can  tall  whathar  two  |ivan  raaaoni  coincida,  it  can't  actually  do  any  intarnal 
procattinf  on  tham.  If  thia  lack  of  intalliganca  had  provad  to  ba  a  limiting  problam,  than  mora  work  would 
hava  baan  axptndad  on  giving  AM  aoma  auch  abilitiat. 

4 

An  altarnativa  achama,  parhapa  avan  a  bit  mora  human-lika,  would  ba  to  (parhapa  only  occaaionally)  allow  a  burat  of 
poorly-motivatad  taaka  to  ba  propoatd,  and  than  uaa  aoma  pruning  critaria  to  wood  out  tha  obvioua  loaara. 
During  thia  tima,  AM  could  typa  out  to  tha  uaar  (who  otharwiaa  would  ba  cloaaly  monitoring  ita  aetivitiaa}  a 
cuta  anthropomorphK  phraaa  lika  Tm  now  aitting  back  and  puffing  on  my  pipa,  loat  in  contamplation.'' 

®  For  axampla,  altarnating  a  randomly-chocan  taak  and  tha  Traat"  taak  (tha  ona  AM  choaa  to  do)  only  abwa  tha  ayatam 
down  by  a  factor  of  2,  yat  H  totally  daatroya  Ha  cradibility  aa  a  rational  raaaarchar  (aa  judgad  by  tho 
human  uaar  of  AM).  Thia  ia  ona  cencluaion  of  axparimant  2  (aaa  Saction  6.2.2,  paga  129). 
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The  following  points  are  important  but  can’t  be  conveyed  by  any  lone  example: 

(i)  Although  AM  appears  to  have  reasonable  natural  language  abilities,  this  is  a  typical  AI 

illusion:  most  of  the  phrases  AM  types  are  mere  tokens,  and  the  syntax  which  the  user 
must  obey  Is  unnaturally  constrained.  For  the  sake  of  clarity,  I  have  "touched  up"  some 
of  the  wording,  indentation,  syntax,  etc.  of  what  AM  actually  outputs,  but  left  the  spirit 
of  each  phrase  intact.  As  the  reader  becomes  more  familiar  with  AM,  future  examples 
can  be  unretouched".  If  he  wishes,  he  may  glance  at  Appendix  5.3,  which  shows 
some  actual  listings  of  AM  in  action. 

(ii)  The  reader  should  be  skeptical  of  the  generality  of  the  program;  is  the  knowledge  base 

"just  right"  (i.e.,  finely  tuned  to  elicit  this  one  chain  of  behaviors)?  The  answer  is 
"A/o"®.  The  whole  point  of  this  project  is  to  show  that  a  relatively  small  set  of  general 
heuristics  can  guide  a  nontrivial  discovery  process.  Each  activity,  each  task,  was 
proposed  by  some  heuristic  rule  (like  "look  for  extreme  cases  of  X")  which  was  used 
time  and  time  again,  in  many  situations.  It  was  not  considered  fair  to  insert  heuristic 
guidance  which  could  only  "guide"  in  a  single  situation. 

This  kind  of  generality  can’t  be  shown  convincingly  in  one  example.  Nevertheless, 
even  within  this  small  excerpt,  the  same  line  of  development  which  leads  to 
decomposing  numbers  (using  TIMES*')  and  thereby  discovering  unique  factorization, 
also  leads  to  decomposing  numbers  (using  ADD*')  and  thereby  discovering  Goldbach’s 
conjecture.  The  same  heuristic  which  caused  AM  to  expect  that  unique  factorization 
will  be  useful,  also  caused  AM  to  suspect  that  Goldbach’s  conjecture  will  be  useless. 

Let  me  reemphasize  that  the  "point"  of  this  example  is  not  the  specific  mathematical 
concepts,  nor  the  particular  chains  of  plausible  reasoning  AM  produces,  nor  the  few  flashy 
conjectures  AM  spouts,  but  rather  an  illustration  of  the  kinds  of  things  AM  does. 


2.3.  Decipberingr  the  Example 

Recall  that  in  general,  each  task  on  the  agenda  will  have  several  reasons  attached  to  it.  In 
the  example  excerpt,  the  reasons  for  each  task  are  printed  just  after  the  task  is  chosen,  and 
before  It’s  executed. 

AM  numbers  its  activities  sequentialiy.  Each  time  a  new  task  is  chosen,  a  counter  is 
incremented.  The  first  task  in  the  example  excerpt  Is  labelled  **  TASK  65  •»,  meaning  that 
the  example  skips  the  first  64  tasks  which  AM  selects  and  carries  out.  The  reason  simply  is 
that  the  development  of  sample  concepts  related  to  divisibility  will  probably  be  more 
intelligible  and  palatable  to  the  reader,  than  AM’s  early  ramblings  in  finite  set  theory. 


'  Tha  dc»i»n  of  AM  wm  finoly  tunod  (O  that  tht  imwar  to  Ihi*  quaation  would  bo  'No*.  Pondar  that  ona! 
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In  the  example  itself,  several  irrelevant  tasks  have  been  excised^  About  half  of  those 
omitted  tasks  were  interesting  in  themselves,  but  all  of  them  were  tangential  or  unrelated  to 
the  development  shown.  The  reader  can  tell  by  the  global  task  numbering  how  many  were 
skipped.  For  example,  notice  that  the  excerpt  jumps  from  Task  67  to  Task  79. 

To  help  gauge  AM’s  abilities,  the  reader  may  be  interested  to  know  that  AM  defined 
"Natural  Numbers"  during  Task  ■44,  and  "TIMES"  was  defined  during  Task  57.  AM 
started  with  no  knowledge  of  numbers,  and  only  scanty  knowledge  of  sets  and  set-operations. 
Task  3,  e.g.,  was  to  fill  in  examples  of  Sets. 

The  concepts  that  AM  talks  about  are  self-explanatory  -  by  and  large.  Below  are  discussed 
some  nonstandard  ones. 

BAG  is  a  kind  of  list  structure,  a  bunch  of  elements  which  are  unordered,  but  one  in  which 
multiple  copies  of  the  same  element  are  permitted.  One  may  visualize  a  paper  bag  filled 
with  cardboard  letters.  Technically,  we  shall  say  that  a  set  is  not  considered  to  be  a  bag.  A 
bag  is  denoted  by  enclosure  within  parentheses,  just  as  sets  are  within  braces.  So  the  bag 
containing  X  and  four  Y’s  might  be  written  (X  Y  Y  Y  Y),  and  would  be  considered 
indistinguishable  from  the  bag  (Y  Y  Y  X  Y). 

Number  will  mean  (typically)  a  positive  integer. 

TIMES'^  is  a  particular  relation.  For  any  number  x,  TIMES’ '(x)  is  a  set  of  bags.  Each 
bag  contains  some  numbers  which,  when  multiplied  together,  equal  x.  For  example, 
TIMES-'(18)  -  {  (18)  (2  9)  (2  3  3)  (3  6)  }.  Checking,  we  see  that  multiplying,  e.g.,  the 
numbers  in  the  bag  (2  3  3)  together,  we  do  get  2x2x3»l8.  TIMES* ‘(x)  contains  all  possible 
such  bags  (containing  natural  numbers  >1). 

/4DD'*  Is  a  relation  analogous  to  TIMES*'.  For  any  number  x,  ADD*'(x)  is  also  a  set  of 
bags.  Each  bag  contains  a  bunch  of  numbers  which,  when  added  together,  equal  x.  For 
example,  ADD*'(4)  ■  { (4)  (I  I  1  I)  (1  1  2)  (1  3)  (2  2) }.  ADD*'(x)  contains  all  possible  such 
bags  (containing  numbers  >0);  it  finds  all  possible  partitions  of  x. 

Divisors-of  is  a  more  standard  relation.  For  any  number  x,  Divi$ors-of(x)  is  the  set  of  all 
positive  numbers  which  divide  evenly  into  x.  For  example,  Divisors-of(l8)  -  {I  2  3  6  9  18}. 

The  definitions  for  most  of  the  mathematical  terms  used  in  the  excerpt  can  be  found  in  the 
Glossary  (Appendix  1.1).  Whenever  there  is  a  conflict  between  "computer  science  jargon" 
and  "math  jargon",  I  have  opted  for  the  latter.  So,  e.g.,  all  "functions"  are  necessarily  single¬ 
valued  for  each  member  of  their  domain. 

AM  is  an  interactive  computer  program.  It  prints  out  phrases  indicating  what  it  is  doing  at 
each  moment,  and  a  human  being,  referred  to  as  the  User,  watches  AM’s  activities.  At  any 
moment,  the  user  can  interrupt  AM  to  ask  questions  or  re-direct  its  energies.  In  the  excerpt, 
the  user  never  re-directs  AM,  although  he  does  occasionally  ask  AM  to  rename  some 
concept  and  to  explain  its  behavior  (e.g.,  just  after  *»  TASK  106  »»).  For  the  benefit  of  the 
reader,  I  have  bold-faced  all  the  user’s  comments. 

^  Thit  i*  f*ir,  d*«pit«  th«  of  Exporimont  2  («m  Soction  6.22)  b«c*u«t  th«  romointnf  t*tk«  clump  toc«th«r  in 

two*,  thr**«,  (tci  th«y  ir*  unint*rrvpt*d  Imt*  of  rttMrch  (*t.,  T(«k«  65-67),  o*p«r*t«d  by  vory  tore* 
lap*  (of.,  tlw  jump  from  Tt«k  67  to  79). 
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My  pe'  sonal  notes  and  comments  occur  right- justified,  in  italics. 


2.4.  The  Example  Itself 


♦♦  Task  65t  ** 

Filling  in  examples  of  the  following  concept:  "Divisors-of". 

3  Reasons:  (1)  No  known  examples  for  Divisors-of  so  far. 

(2)  TIMES,  which  is  related  to  Divisors-of,  is  now  very  interesting. 

(3)  Focus  of  attention:  AM  recently  defined  Divisors-of. 


26  examples  found,  in  9.2  seconds,  e.g.:  Divisors-of(6)-{l  2  3  6}. 
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»»  TwK  66;  »» 

Considering  numbers  which  have  very  small  sets  of  Divisors-of  (i.e.,  very  few  divisors). 

2  Reasons:  (1)  Worthwhile  to  look  for  extreme  cases. 

(2)  Focus  of  attention:  AM  recently  worked  on  Divisors-of. 

Filling  in  examples  of  numbers  with  0  divisors. 

0  examples  found,  In  4.0  seconds. 

Conjecture:  no  numbers  have  0  divisors. 

Filling  in  examples  of  numbers  with  1  divisors. 

I  examples  found,  in  4.0  seconds,  e.g.:  Divisors-of{l  )■{!}. 

Conjecture:  1  is  the  only  number  with  1  divisor. 

Filling  in  examples  of  numbers  with  2  divisors. 

24  examples  found,  in  4.0  seconds,  e.g.:  Oivisors-of(13)B{l  13}. 

No  obvious  conjecture.  This  kind  of  number  merits  more  study. 

Creating  a  new  concept:  "Numbers-with-2-divisors". 

yIM  had  previously  derived  the  notion  of  singleton, 
doubleton,  etc.  Above,  AM  was  actually  defining,  e.g.,  the 
set  of  numbers  which  gets  mapped  into  a  doubleton  of 
numbers,  by  the  operation  Divisors-of.  /  took  the  liberty 
of  translating  this  into  the  above  form,  which  uses,  e.g., 
the  number  "2". 

Filling  in  examples  of  numbers  with  3  divisors. 

II  examples  found,  in  4.0  seconds.  e.g.:  Divisors-of{49)-{l  7  49). 

All  numbers  with  3  divisors  ire  also  Squares.  This  kind  of  number  merits  more  study. 

Creating  a  new  concept:  "Numbers-with-3-divisors". 

How  did  AM  notice  that  conjecture?  It  took  a  random 
example  of  Numbers-with-B-divisiors,  say  49.  Then  it 
asked  what  other  known  concepts  *49"'  was  an  example 
of.  The  two  answers  were:  Odd-numbers  and  Perfect- 
squares.  AM  then  tested  these  conjectures  on  the  other 
ten  examples  just  found.  T he  only  surviving  conjecture 
was  that  all  numbers-with-S-dlvisors  are  also  perfect- 
squares. 
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*»  TisK  67:  ** 

Considering  the  square-roots  of  Numbers-with-3-divisors. 

2  Reasons:  (1)  Numbers-with-3-div"sors  are  unexpectedly  also  perfect  Squares. 

(2)  Focus  of  attention:  AM  recently  worked  on  Numbers-with-3-divisors. 

All  square-roots  of  Numbers-with-3-divisors  seem  to  be  N'jmbers-with-2-divisors. 
e.g.,  Oivisors-of(  Square-root(169) )  -  Divisors-of(13)  «  {i  13}. 

Formulating  the  converse  to  this  statement.  Empirically,  it  seems  to  be  true. 

The  square  of  each  Number-v'ith-2-divisors  seems  to  be  a  Number-with-3-divisors. 

This  is  very  unusual.  It  is  not  plausibly  a  coincidence.  (Chance  of  coincidence  is  <  .001) 

Boosting  interestingness  factor  of  the  concepts  involved: 

Interestingness  factor  of  "Divisors-of"  raised  from  300  to  AOO. 

Interestingness  factor  of  "Numbers-with-2-divisors"  raised  from  100  to  600. 
Interestingness  factor  of  "Numbers-with-O-divisors"  raised  from  200  to  700. 


USER:  Call  the  set  of  numbers  with  2  divisors  "Primes". 


»»  Task  68;  »* 

Considering  the  squares  of  Numbers-with-3-divisors. 

2  Reasons:  (1)  Squares  of  Numbers-with-2-divisors  were  interesting. 

(2)  Focus  of  attention:  AM  recently  worked  on  Numbers-with-3-divisors. 


This  gap  in  the  sequencing  —  from  task  67  to  task  79  — 
eliminates  some  tangential  and  boring  tasks.  See  page 
19  for  an  explanation. 
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**  TmK  79t  ** 

Examining  TIMES* '(x),  looking  for  patterns  involving  its  values. 

2  Reasons:  (1)  TIMES*’  is  related  to  the  newly-interesting  concept  "Divisors-of 
(2)  Many  examples  of  TIMES*’  are  known,  to  induce  from. 

Looking  specifically  at  TIMES*’{12),  which  is  { (12)  (2  6)  (2  2  3)  (3  4) }. 

13  conjectures  proposed,  after  2.0  seconds. 

e.g.,  ''TIMES*’(x)  always  contains  a  bag  containing  only  even  numbers". 

Testing  the  conjectures  on  other  examples  of  TIMES*’. 

5  false  conjectures  deal  with  even  numbers. 

AM  will  sometime  consider  the  restriction  of  TIMES*’  to  even  numbers. 

Only  2  out  of  the  13  conjectures  are  verified  for  all  26  known  examples  of  TIMES*’: 

Conjecture  1:  TIMES'’(x)  always  contains  a  singleton  bag. 
e.g.,  TIMES*’(12),  which  is  { (12)  (2  6)  (2  2  3)  (3  4) },  contains  (12). 
e.g.,  TIMES*’(13),  which  is  { (13) ),  contains  (13). 

Creating  a  new  concept,  "Single-times". 

Single-times  is  a  relation  from  Numbers  to  Bags-of-numbers. 

Single-times(x)  is  all  bags  in  TIMES*’(x)  which  are  singletons, 
e.g.,  Single-times(12)"{  (12) }. 
e.g.,  Single-times(13H  (13) }. 

Conjecture  2:  TIMES* ’(x)  always  contains  a  bag  containing  only  primes. 
e.g.,  TIMES*’(12),  which  Is  { (12)  (2  6)  (2  2  3)  (3  4) ),  contains  (2  2  3). 
e.g.,  TIMES*’(13),  which  is  { (13) },  contains  (13). 

Creating  a  new  concept,  "Prime-times". 

Prime-times  is  a  relation  from  Numbers  to  Bags-of-numbers. 

Prime-times(x)  is  all  bags  in  TIMES* ’(x)  which  contain  only  primes, 
e.g.,  Prime-times(12)»{  (2  3  3) }. 
e.g.,  Prime-timos(13)-{  U3) }. 
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**  TwK  80;  **  “ 

Considering  the  concept  "Prime-times".  h 


2  Reasons!  (1)  Conjees  about  Prime-times  will  tell  much  about  Primes  and  TIMES'*.  h 

(2)  Focus  of  attention:  AM  recently  defined  Prime-times.  ^ 

r , 

Looking  specifically  at  Prime-times(48),  which  is  { (2  2  2  2  3) }. 

4  conjectures  proposed,  after  .6  seconds.  t 

e.g.,  "x  is  never  inside  any  member  of  Prime-times{x)"  F 

Testing  them  on  other  examples  of  Prime-times.  r 

j. 

Oniy  1  out  of  the  4  conjectures  are  verified  for  all  23  known  examples  of  Prime-times: 


Conjecture  1:  Prime-times(x)  is  always  a  singleton  set.  It 

That  is,  Prime-times  is  a  function,  not  just  a  relation.  | 

e.g.,  Prime-times(48),  which  is  { (2  2  2  2  3) },  is  a  singleton  set.  r 

e.g.,  Prime-times(47),  which  is  { (47) },  is  a  singleton  set.  K 

This  holds  for  all  17  known  examples  of  Prime-times.  (Chance  of  coincidence  is  ,0001)  L' 

This  fails  for  2  of  the  boundary  cases  (extreme  numbers):  0  and  1. 

Conjecture  is  amended:  Each  number  >1  is  the  product  of  a  unique  bag  of  primes. 

I  suspect  that  this  conjecture  may  be  very  useful.®  'i 

1 

*1^ 

USER:  Call  this  conjecture  "Unique  factorization  conjecture". 


f 


n" 


To  shorn  that  AM  isn’t  really  always  right  on  the  mark, 
the  next  sequence  of  tasks  includes  a  crime  of  omission 
(ignoring  the  concept  of  Partitions)  and  a  false  start 
(worrying  about  numbers  which  can  be  represented  as  the 
sum  of  two  primes  in  precisely  one  way).  Notice  the 
skip  here;  S  tasks  have  been  omitted. 


®  How  did  AM  know  Ihit?  0n«  of  1h«  (unfar<un*t«ly  f«w<)  m«t*-hturi»tic»  in  AM  «iid  tho  followinf:  ”Whtn  mine  tho  ’lOOK 
at  the  inverse  of  extreme  items  under  the  operation  f  rui«,  TKk  th«  foliowinf  not*  onto  th« 
lnt«ro«t  fKot  of  tho  now  eencopt  which  w  erottod’  'Conjectures  involving  this  concept  and  f  (or 
f'*)  are  natural,  interesting,  and  probably  useful.’  *®''**p*  PRIMES  was  dofinod 

vsint  tho  ‘oxtroms’  heuristic  rule,  with  f^Divisors-of.  Whon  PRIMES  was  first  croalod,  tho  moto-rulo  wo  just 
prosontod  tKkod  tho  foliowinf  noto  onto  Primos.intorost  •Conjectures  involving  Primes  and 

division  (or  multiplication)  are  natural,  interesting,  and  probably  useful.’ 

uniquo  fMtoriittion  conjocturo  triffort  this  footuro,  whorots  Goldboch't  eonjoeturo  wouldn't. 
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**  Twk  84s  *» 

Examining  ADO'^x),  looking  for  patterns  involving  its  values. 

2  Reasons:  (1)  ADD"'  is  analogous  to  the  newly-interesting  concept  "TIMES"'". 

(2)  Many  examples  of  ADD"'  are  known,  to  induce  from. 

Looking  specifically  at  ADD"'(6),  which  is  { (1  1  1  1  1  1)  (1  1  i  1  2)  (1  1  1  3)  (1  1  2  2) 

(1  14X1  2  3)  (1  5)  (2  2  2)  (2  4)  (3  3)  (6)). 

17  conjectures  proposed,  after  3.9  seconds, 
e.g.,  "ADD"'(x)  always  contains  a  bag  of  primes". 

Testing  them  on  other  examples  of  ADD"'. 

Only  11  out  of  the  17  conjectures  are  verified  for  all  19  known  examples  of  ADD*': 

3  out  of  the  1 1  conjectures  were  false  until  amended. 

Conjecture  1:  ADD*'(x)  never  contains  a  singleton  bag. 

Conjecture  2:  ADD"'(x)  always  contains  a  bag  of  size  2  (also  called  a  "pair"  or  a  "doubleton"), 
e.g.,  ADD" '(6)  contains  (1  5),  (2  4),  and  (3  3). 
e.g.,  ADD*'(4)  contains  (1  3),  and  (2  2). 

Creating  a  new  concept,  "Pair-add". 

Pair-add  is  a  relation  from  Numbers  to  Pairs-of-numbers. 

Pair-add(x)  is  all  bags  in  ADD*'(x)  which  are  doubletons  (i.e.,  of  size  2). 
e.g.,  Pair-add(12)-{  (1  11)  (2  10)  (3  9)  (4  8)  (5  7)  (6  6) }. 
e.g.,  Pair-add(4)«{  (1  3)  (2  2) }. 

Conjecture  3:  ADD*'(x)  always  contains  a  bag  containing  only  Ts. 


Conjecture  10:  ADD''(x)  always  contains  a  pair  of  primes. 

This  conjecture  is  false.  Conjecture  is  amended: 

"ADD"'(x)  usually  (but  not  always)  contains  a  pair  of  primes." 

e.g.,  ADD"'(10)  contains  (3  7),  and  (5  5). 

e.g.,  ADD" '(4)  contains  (2  2). 

e.g.,  ADD" '(11)  does  not  contain  a  pair  of  primes. 

Creating  a  new  concept,  "Prime-add". 

Prime-add  is  a  relation  from  Numbers  to  Pairs-of-numbers. 
Prime-add(x)  is  all  bags  in  AD0"'(x)  which  are  pairs  of  primes, 
e.g.,  Prime-add(12)»{  (5  7) }. 
e.g.,  Prime-add(10)-{  (3  7)  (5  5) }. 
e.g.,  Prime-add(ll)  "  {  } 


^  r-Tt 
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*♦  Task  106;  ** 

Considering  the  set  of  numbers  for  which  Prime -add  is  defined  (has  non-empty  value). 

1  Reason:  (1)  Prime-add  often  has  non-empty  value.  Worth  isolating  that  case. 

Warning;  no  task  on  the  agenda  has  an  interestingness  value  above  200!!! 

Creating  a  new  concept  "Prime-addable". 

Prime-addable  is  a  kind  of  Number,  x  is  Prime-addable  if  Prime-add(x)  is  non-empty. 

Will  spend  5.0  seconds  filling  in  examples  of  Prime-addable. 

18  examples  found.  Here  are  some  of  them:  4  5  6  7  8  9  10  12  13  14  17  16  18. 
Empirically,  all  even  numbers  are  also  in  this  set  (ignoring  boundary  cases:  0  2  4  6). 

So  conjectured.  Danger:  must  examine  boundary  cases:  the  numbers  0,  2,  4,  and  6. 

Two  exceptions  noticed.  The  only  exceptions  are  the  smallest  boundary  cases:  0,  2. 
Conjecture  is  amended:  All  even  numbers  >2  are  the  sum  of  two  primes. 

Warning:  i  expect  this  conjecture  wili  be  cute  but  useless. 


USER:  Why? 


Because  the  concept  of  "Primes"  originally  arose  as  extreme  cases  of  Divisors-of.  The  more 
closely  an  operation  X  is  related  to  the  concept  Divisors-of,  the  more  naturai  wili  be  any 
conjecture  involving  both  that  operation  X  and  Primes.  E.g.,  conjectures  involving  both 
Primes  and  Times  will  be  natural  and  useful.  But  this  conjecture,  which  involves  Primes  and 

ADD'*,  will  be  cute  but  useless,  since  the  relation  "ADD*'"  is  unrelated  to  the  relation 
"Divisors-of". 

End  of  explanation. 


USER:  Call  this  conjecture  "Goldbach's  conjecture”. 


*»  Task  107;  *♦ 

Considering  the  set  of  numbers  for  which  the  relation  Prime-add  is  single-valued. 

3  Reasons:  (1)  Prime-add  often  has  singleton  value.  Worth  isolating  that  case. 

(2)  Restricted  to  this  set.  Prime-add  would  be  a  function. 

(3)  Focus  of  attention:  AM  recently  worked  on  Prime-add. 

Creating  a  new  concept  "Uniquely-prime-addable". 

"Uniquely-prime-addable"  is  a  kind  of  Number. 

X  is  Uniquely-prime-addable  if  Prime-add(x)  is  a  singleton. 

Will  spend  10.0  seconds  filling  in  examples  of  Uniquely-prime-addable. 

1 1  examples  found.  Here  are  some  of  them:  4  5  7  8  9  12  13. 

No  obvious  conjecture  derived  empirically. 

Will  forget  "Uniquely-prime-addable  numbers",  if  no  Ties  found  in  near  future. 


1%: 
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2.5.  Recapping  the  Example 

Let’s  once  again  eavesdrop  on  a  mathematician,  as  he  describes  to  a  colleague  what  AM 
did. 

This  example  was  preceded  by  the  momentous  discoveries  of  multiplication  and  division. 
Several  interesting  properites  of  these  operations  were  noticed.  The  first  task  which  was 
illustrated  (*»  Task  65  *♦)  involves  exploring  the  concept  of  "divisors  of  a  number" 
(meaning  all  positive  integers  which  divide  evenly  into  the  given  number).  After  tiring  of 
finding  examples  of  this  relation,  AM  investigates  extreme  cases;  that  is,  it  wonders  which 
numbers  have  very  few  or  very  many  divisors. 

AM  thus  discovers  Primes  in  a  curious  way.  Numbers  with  0  or  1  divisor  are  essentially 
nonexistent,  so  they’re  not  found  to  be  interesting.  AM  notices  that  numbers  with  3  divisors 
always  seem  to  be  squares  of  numbers  with  2  divisors  (primes).  This  raises  the 
interestingncsj  of  several  concepts,  including  primes.  Soon  (*♦  TASK  79  ♦*),  another 
conjecture  involving  primes  is  noticed:  Many  numbers  seem  to  factor  into  primes.  This 
causes  a  new  relation  to  be  defined,  which  associates  to  a  number  x,  all  prime  factorizations 
of  X.  The  first  question  AM  asks  about  this  relation  is  "is  it  a  function?".  This  question  is 
the  full  statement  of  the  unique  factorization  conjecture:  the  fundamental  theorem  of 
arithmetic.  AM  recognized  the  value  of  this  relationship,  and  assigned  it  a  high 
interestingness  rating. 

In  a  similar  manner,  though  with  lower  hopes,  it  noticed  some  more  relationships  involving 
primes,  including  Goldbach’s  conjecture.  AM  quite  correctly  predicted  that  this  would  turn 
out  to  be  cute  but  of  no  future  use  mathematically. 

The  last  activity  mentioned  (*«  TASK  107  *«)  shows  AM  examining  a  rather  nonstandard 
concept:  "numbers  which  can  be  written  as  the  sum  of  a  pair  of  primes,  in  only  one  way". 
These  are  termed  "uniquely-prime-addable"  numbers.  It  was  mildly  unfortunate  that  AM 
gave  up  on  this  concept  before  noticing  that  is  uniquely-prime-addable,  for  any  prime 
number  p,  and  that  in  fact  these  are  the  only  odd  uniquely-prime-addable  numbers.  The 
session  was  repeated  once,  with  a  human  user  telling  AM  explicitly  to  continue  studying  this 
concept.  AM  did  in  fact  construct  "Uniquely-prime-addable-odd-numbers",  and  then  notice 
this  relationship.  Here  we  see  an  example  of  unstable  equilibrium:  if  pushed  slightly  this 
way,  AM  will  get  very  interested  and  spend  a  lot  of  time  working  on  this  kind  of  number. 
Since  it  doesn’t  have  all  the  sophistication  (i.e.,  compiled  hindsight)  that  we  have,  it  can’t 
know  instantly  whether  what  it’s  doing  will  be  fruitless. 
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' Objectively’  given,  ’important’  problems  may  arise  pin  mathj.  But  even  then  the 
mathematician  is  essentially  free  to  take  it  or  leave  it  and  turn  to  something  else, 
while  an  ’important’  problem  in  (any  other  science]  is  usually  a  conflict,  a 
contradiction,  which  ’must’  be  resolved.  The  mathematician  has  a  wide  choice  of 
which  way  to  turn,  and  he  enjoys  a  very  considerable  freedom  in  what  he  does. 

—  von  Noumonn 


AM  is  one  of  those  awkward  programs  whose  representations  only  make  sense  if  you 
already  understand  how  they  will  be  operated  on.  A  discussion  of  AM’s  control  structure 
(this  chapter  and  the  next)  must  thus  precede  a  discussion  of  concepts  and  how  they  are 
represented  (Chapter  5).  Section  2.1  gave  the  reader  a  sufficient  knowledge  of  AM’s 
"anatomy"  to  follow  these  chapters.  Thus  armed  with  a  cursory  knowledge  of  the  "statics"  of 
AM,  we  shall  proceed  to  describe  in  detail  Its  "dynamics". 

Section  3.1  will  give  the  reader  a  feeling  for  the  immensity  of  AM’s  search  space.  This  is 
the  "problem".  The  next  section  will  give  the  top-level  "solution":  the  flow  of  control  is 
governed  by  a  job-list,  an  agenda  of  plausible  tasks.  Section  3.3  will  present  some  details  of 
this  global  control  scheme. 

Chapter  4  deals  with  the  way  AM’s  heuristics  operate;  this  could  be  viewed  as  the  "low- 
level"  or  local  control  structure  of  AM.  Chapter  5  contains  some  detailed  information 
about  the  actual  concepts  (and  heuristics)  AM  starts  with,  and  a  little  more  about  their 
design  and  representation.  The  reader  is  also  directed  to  Appendix  5,  which  presents 
several  detailed  examples  of  AM  "in  action". 


3.1.  AM*8  Search 


To  develop  mathematics,  one  must  always  labor  to  substitute  ideas  for  calculations. 

—  Diriehlet 


Let’s  first  spend  a  paragraph  reviewing  how  concepts  are  stored.  AM  contains  a  collection 


J  .V  ir 
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of  data  structures,  called  concepts.  Each  concept  is  meant  to  coincide  intuitively  with  one 
mathematical  idea  (e.g.,  Sets,  Union,  Trichotomy).  As  such,  a  concept  has  several  aspects  or 
parts,  called  facets  (e.g..  Examples,  Definitions,  Domain/range,  Worth).  If  you  wish  to  think 
of  a  concept  as  a  "frame",  then  its  facets  are  "slots"  to  be  filled  in.  Each  facet  of  a  concept 
will  either  be  totally  blank,  or  else  will  contain  a  bunch  of  entries.  For  example,  the 
Algorithms  facet  of  the  concept  Union  may  point  to  several  equivalent  LISP  functions,  each 
of  which  can  be  used  to  form  the  union  of  two  sets'.  Even  the  "heuristic  rules"  are  merely 
entries  on  the  appropriate  kind  of  facet  (e.g.,  the  entries  on  the  Interest  facet  of  the 
Structure  concept  are  rules  for  judging  the  Interestingness  of  Structures^). 

At  any  moment,  AM  contains  a  couple  hundred  concepts,  each  of  which  has  only  some  of  its 
facets  filled  in.  AM  starts  with  115  concepts,  and  grows  to  about  300  concepts  before 
running  out  of  time/space.  Most  facets  of  most  concepts  are  totally  blank.  AM’s  basic 
activity  is  to  select  some  facet  of  some  concept,  and  then  try  to  fill  in  some  entries  for  that 
slot®.  Thus  the  primitive  kind  of  "task"  for  AM  is  to  deal  with  a  particular  facet/concept 
pair.  A  typical  task  looks  like  this: 

Check  the  enirias  on  the  "Oomtin/rans#”  facet  of  the  "Bas-lnaert"  concept 

If  the  average  concept  has  ten  or  twenty  blank  facets,  and  there  are  a  couple  hundred 
concepts,  then  clearly  there  will  be  about  20x200-4000  "fill-in"  type  tasks  for  AM  to  work 
on,  at  any  given  moment.  If  several  hundred  facets  have  recently  been  filled  in,  there  will 
be  that  many  "check-entries"  type  tasks  available.  Executing  a  task  happens  to  take  around 
ten  or  twenty  cpu  seconds,  so  over  the  course  of  a  few  hours  only  a  small  percentage  of  these 
tasks  can  ever  be  executed.^ 

Since  most  of  these  tasks  will  never  be  explored,  what  will  make  AM  appear  smart  -  or 
stupid  -  are  its  choices  of  which  task  to  pick  at  each  moment.®  So  it’s  worth  AM’s  spending 
a  nontrivial  amount  of  time  deciding  which  task  to  execute  next.  On  the  other  hand,  it  had 
better  not  be  too  much  time,  since  a  task  does  take  only  a  dozen  seconds.® 

One  question  that  mijst  u-  answered  is:  What  percentage  of  AM’s  legal  moves  (at  any 


Th«  rttterw  for  hiving  multipli  ilgorithm*  it  thit  lomitiinii  AM  will  win!  on*  thit  i»  fitt,  lomitimii  AM  will  bo  moro 
concirmd  with  oconomizing  on  itorigi,  lomotimoi  AM  will  wint  to  'inilyzo"  in  ilgorithm,  ind  for  thit 
purpoii  h  mull  bi  i  viry  un-oplimizid  function,  otc. 

2 

A  typicil  such  rulo  ii:  *A  ..ructuri  it  viry  intiritting  if  ill  Hi  iloininti  in  mildly  intiriiting  in  priciiily  tho  iimo  wiy." 

®  Thit  ii  not  quiti  eomphti  In  idditwn  to  filling  in  intrioi  for  i  givin  ficit/concipt  piir,  AM  miy  wiih  to  chock  it,  iplit  it 
up,  riorginizi  it,  itc 

q 

Tho  procho  *18  ticondt  ivirigi"  figuri  it  not  importint.  All  hturittk-ttirch  progrimt  tuffor  thit  ttmo  hindieip:  At  tho 
dopth  to  which  thiy'vi  tiirchtd  incriittt,  tho  pircintigi  of  nodit  (it  or  ibovi  thit  lovil)  which  hivo 
btin  oximinod  dicfiitit  ixpontntiilly  (ittuming  tho  brinching  fictor  b  it  ttrktly  lirgir  thin  unity). 

®  Thit  it  truo  of  ill  hourittic  tiireh  progrimt  Tho  brinchior  tho  tiirch,  thi  moro  it  ippliot. 

®  Tho  intwor  it  thit  AM  tpondi  thit  'dociding’  timi  not  kmt  boforo  i  titk  it  picked,  but  rithor  iich  timo  i  titk  it  iddod 
to  tho  igindi  A  littli  undor  I  cpu  ticond  it  tpont,  on  tho  ivirigi,  to  piKO  tho  titk  proporly  on  tho 
igtnda,  to  ittign  it  I  mtiningfu!  numirie  priority  viluo.  So  "letion  timo"  i*  roughly  one  order  of  mignitudo 
lirgir  thin  'deciding  time*. 


Chtptar  3 


AM:  Oitcoviry  m  Math«m>tic«  at  Hauriatic  Saarch 


-30- 


typical  moment)  would  be  considered  intelligent  choices,  and  what  percentage  would  be 
irrational?  The  answer  comes  from  empirical  results.  The  percentages  vary  wildly 
depending  on  the  previous  few  tasks.  Sometimes,  AM  will  be  obviously  "in  the  middle"  of 
a  sequence  of  tasks,  and  only  one  or  two  of  the  legal  tasks  would  seem  plausible.  Other 
times,  AM  has  just  completed  an  investigation  by  running  into  dead-ends,  and  there  may  be 
hundreds  of  tasks  it  could  choose  and  not  be  criticized.  The  median  case  would  perhaps 
permit  about  6  of  the  legal  tasks  to  be  judged  reasonable. 

It  is  important  for  AM  to  locate  one  of  these  currently-plausible  tasks,  but  it’s  not  worth 
spending  much  time  deciding  which  of  them  to  work  on  next.  AM  stilt  faces  a  huge  search: 
find  one  of  the  6  winners  out  of  a  few  thousand  candidates. 

Its  choice  of  tasks  is  made  even  more  important  due  to  the  10-second  "cycle  time"  -  the  time 
to  Investigate/execute  one  task.  A  human  user  is  watching,  and  ten  seconds  is  a  nontrivial 
amount  of  time  to  him.  He  can  therefore  observe,  perceive,  and  analyze  each  and  every 
task  that  AM  selects.  Even  just  a  few  bizarre  choices  will  greatly  lower  his  opinion  of  AM’s 
intelligence.  The  trace  of  AM’s  actions  is  what  counts,  not  its  final  results.  So  AM  can’t 
draw  much  of  its  apparent  intelligence  from  the  speed  of  the  computer. 

Chess-playing  programs  have  had  to  face  the  dilemma  of  the  trade-off  between  "intelligence" 
(foresight,  inference,  processing,...)  and  total  number  of  board  situations  examined.  In  chess, 
the  characteristics  of  current-day  machines,  language  power  vs.  speed,  and  (to  some  extent) 
the  limitations  of  our  understanding  of  how  to  be  sophisticated,  have  to  date  unfortunately 
still  favored  fast,  nearly-blind^  search.  Although  machine  speed  and  LISP  slowness  may 
allow  blind  search  to  win  over  symbolic  Inference  for  shallow  searches,  it  can’t  provide  any 
more  than  a  constant  speed-up  factor  for  an  exponential  search.  Inference  is  slowly  gaining 
on  brute  force,*  and  must  someday  triumph. 

Since  the  number  of  "legal  moves"  for  AM  at  any  moment  is  in  the  thousands,  it  is 
unrealistic  to  consider  "systematically"®  walking  through  the  entire  space  that  AM  can  reach. 
In  AM’s  problem  domain,  there  is  so  much  "freedom"  that  symbolic  inference  finally  can 
win  over  the  "simple  but  fast"  exploration  strategy’®. 


I*,  u*in(  a  vary  aimpi*  atatic  tvalualian  function, 
c 

E  f ,  c*«  [Barhnar  7A\  Thara,  aairchinf  ta  uaad  mainly  to  vtrify  plauaibla  movaa  (a  convar(tnt  procaaa),  not  to  diacovor 
tham  (a  buahiar  aaarch). 

9 

Of,  axhauativaty,  or  uainf  vLfi  minimaxint,  ate 

Thia  ia  tha  author’a  opinio^  partially  aupportad  by  tha  raaulta  of  AM  Paul  Cohan  diaagraaa,  faalini  that  machina  apaad 
ahould  ba  tha  kay  to  an  automatad  mathamalician’a  auccaaa 
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There  exist  too  many  combinations  to  consider  all  combinations  of  existing  entitles; 
the  creative  mind  must  only  propose  those  of  potential  Interest. 

Poincar** 


A  great  deal  of  heuristic  knowledge  is  required  to  constrain  the  necessary  processing 
effectively,  to  zero  in  on  a  good  task  to  tackle  next.  This  is  done  in  two  stages. 

1.  A  list  of  plausible  facet/concept  pairs  is  maintained.  Nothing  can  get  onto  this  list 

unless  there  is  some  reason  why  Ailing  in  (or  checking)  that  facet  of  that  concept 
would  be  worthwhile. 

2.  Ail  the  plausible  tasks  on  this  "job  list"  are  ranked  by  the  number  and  strength  of 

the  different  reasons  supporting  them.  Thus  the  facet/concept  pairs  near  the  top  of 
the  list  will  all  be  very  promising  tasks  to  work  on. 

The  Arst  of  these  constraints  is  akin  to  replacing  a  legal  move  generator  by  a  plausible 
move  generator.  The  second  kind  of  constraint  is  akin  to  using  a  heuristic  evaluation 
function  to  select  the  best  move  from  among  the  plausible  ones.' ' 

The  job-list  or  agenda  is  a  data  structure  which  is  a  natural  way  to  store  the  results  of  these 
procedures.  It  is  (1)  a  list  of  all  the  plausible  tasks  which  have  been  generated,  and  (2)  it  is 
kept  ordered  by  the  numeric  estimate  of  how  worthwhile  each  task  is.  A  typical  entry  on 
the  agenda  might  look  like  this: 


It 


PMt  Al  prof  rim*  (of,  [StmutI  67})  havt  indkitod  th«t  cenotroininf  lontration  (I)  i«  moro  imporUnt  than  oophiolicatod 
ordorinf  of  tho  rotultant  cindiditt*  (2).  Thi*  wm  confirr^  by  tho  oxporimont*  porformod  on  AM. 
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The  actual  top-level  control  structure  is  simply  to  pluck  the  top  task  from  the  agenda  and 
execute  it.  That  is,  select  the  facet/concept  pair  having  the  best  supporting  reasons,  and  try 
to  fill  in  that  facet  of  that  concept. 

While  a  task  is  being  executed,  some  new  tasks  might  get  proposed  and  merged  into  the 
agenda.  Also,  some  new  concepts  might  get  created,  and  this,  too,  would  generate  a  flurry  of 
new  tasks. 

After  AM  stops  filling  in  entries  for  the  facet  specified  in  the  chosen  task,  it  removes  that 
task  from  the  agenda,  and  moves  on  to  work  on  whichever  task  is  the  highest-rated  at  that 
time. 

The  reader  probably  has  a  dozen  good  questions  in  mind  at  this  point  (e.g..  How  do  the 
reasons  get  rated?.  How  do  the  tasks  get  proposed?.  What  happens  after  a  task  is 
selected?,...).  The  next  section  should  answer  most  of  these.  Some  more  judgmental  ones 
(How  dare  you  propose  a  numeric  calculus  of  plausible  reasoning?!  If  you  slightly  de-tune 
all  those  numbers,  does  the  system’s  performance  fall  apart?...)  will  be  answered  in  Chapter 
7. 


Creative  energy  U  used  mainly  to  ask  the  right  question. 

~  Halmot 


3.11.  Why  an  Agenda? 


This  subsection  provides  motivation  for  the  following  one,  by  arguing  that  a  job-list  scheme 
is  a  natural  mechanism  to  use  to  manage  the  task-selection  problem  AM  faces.  If  that  seems 
obvious  to  you,  feel  free  to  skip  ahead  to  section  3.3.-,?,  page  33. 


Recall  that  AM  must  zero  in  on  one  of  the  best  few  tasks  to  perform  next,  and  it  repeatedly 
makes  this  choice.  At  each  moment,  there  might  be  thousands  of  directions  to  explore 
(plausible  tasks  to  consider). 


If  all  the  legal  tasks  were  written  out,  and  reasons  were  thought  up  to  support  each  one, 
then  perhaps  we  could  order  them  by  the  strength  of  those  reasons,  and  thereby  settle  on 
the  "best"  task  to  work  on  next.  In  order  to  appear  "smart"  to  the  human  user,  AM  should 
never  execute  a  task  having  no  reasons  attached. 


Some  magical  function  will  be  assumed  to  exist,  which  provides  a  numeric  rating,  a  priority 
value,  for  any  given  task.  The  function  looks  at  a  given  facet/concept  pair,  examines  all  the 
associated  reasons  supporting  that  task,  and  computes  an  estimate  of  how  worthwhile  it 
would  be  for  AM  to  spend  some  time  now  working  on  that  facet  of  that  concept. 


v:  3-T=t^*Ty  >c'cxrric^-7^'i^^*7pir:Kn?rv"JWJ7rrKrf"7P^!jc^j^7T?CT.'?A7^^:^'KT^rE^is 
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So  AM  will  maintain  a  list  of  those  legal  tasks  which  have  some  good  reasons  tacked  onto 
them,  which  justify  wh'  each  task  should  be  executed,  why  it  is  plausible.  At  least 
implicitly,  AM  has  a  numeric  rating  for  each  task.  The  obvious  control  algorithm  is  to 
choose  the  task  v  ith  the  highest  rating,  and  work  on  that  one  next. 

Assuming  the  tasks  on  this  list  are  kept  ordered  by  this  numeric  rating,  then  AM  can  Just 
repeatedly  pluck  the  highest  task  and  execute  it.  While  it’s  executing,  some  new  tasks  might 
get  proposed  and  added  to  the  list  of  tasks.  Reasons  are  kept  tacked  onto  each  task  on  this 
list,  and  form  the  basis  for  the  numeric  priority  rating. 

Give  or  take  a  few  features,  this  notion  of  a  "job-list"  is  the  one  which  AM  uses.  It  is  also 
called  an  agenda.'^  "A  task  on  the  agenda"  is  the  same  as  "a  job  on  the  job-list"  is  the  same 
as  "a  facet/concept  pair  which  has  been  proposed"  Is  the  same  as  "an  active  node  In  the 
search  space".  Henceforth,  I’ll  use  the  following  all  interchangeably:  task,  facet/concept  pal*-, 
node,  job.  This  should  break  up  the  monotony'®. 


w 


p. 


i.'* 


The  flavor  of  agenda-list  used  here  is  similar  to  the  control  structure  of  HEARSAY-II 
tLesser/Fennell/Erman/Reddy  75j.  Vast  numbers  of  tasks  are  proposed  and  added  to  the 
job-list.  Occasionally,  when  some  new  data  arrives,  some  task  is  repositioned 


8.S.2.  Details  of  the  Agenda  scheme 

At  each  moment,  AM  has  many  plausible  tasks  (hundreds  or  even  thousands)  which  have 
been  suggested  for  some  good  reason  or  other,  but  haven’t  been  carried  out  yet.  Each  task 
is  at  the  level  of  working  on  a  certain  facet  of  a  certain  concept:  filling  it  in,  checking  it,  etc. 
Recall  that  each  task  also  has  tacked  onto  it  a  list  of  symbolic  reasons  explaining  why  the 
task  is  worth  doing. 

Ir.  addition,  a  number  (between  0  and  1000)  is  attached  to  each  reason,  representing  some 
absolute  measure  of  the  value  of  that  reason  (at  the  moment).  One  global  formula'^ 
combines  all  the  reasons’  values  into  a  single  priority  value  for  the  task  as  a  whole.  This 
overall  rating  is  taken  to  indicate  how  worthwhile  it  would  be  for  AM  to  bother  executing 
that  task,  how  interesting  the  task  would  probably  turn  out  to  be.  The  "intelligence"  of 
AM’s  selection  of  task  is  thus  seen  to  depend  on  this  one  formula.  Yet  experiments  show 
that  its  precise  form  is  not  important.  We  conclude  that  the  "intelligence"  has  been  pushed 
down  into  the  careful  assigning  of  reasons  (and  their  values)  for  each  proposed  task. 

1 9 

Borrowtd  from  Kaplan’*  farm  for  th*  job-list  prasant  in  KRl  (aaa  [Bobrow  &  Winograd  77]).  Tor  an  tarliar  ganarat 
discussion  of  agandas,  saa  [Knuth  68]. 

13 

and  covar  my  aloppinass.  Ssriousty,  thanks  to  English,  *Kh  of  thass  tarms  will  conjura  up  a  slightly  diffsrant  imaga;  a 
"job"  is  somsthing  to  do,  a  "nods"  is  an  itam  in  a  saarch  spacs,  "facat/concapt  pair"  raminda  you  of  the 
format  of  a  task 

Haro  ia  that  formula;  Worth(J)  ■  ||S0RT(SUM  R  ®)||  x  (  O.ZxWortWA)  ♦  0.3xWorth(F)  ♦  0.5xWorth(C)],  whara  J  •  job  to  ba 
judgad  •  (Act  A,  Facat  F,  Concapt  C),  and  (Rj)  ara  the  rating*  of  th*  raason*  supporting  J.  For  th*  aampi* 
job  picturad  in  th*  box  balow,  A«Fillin,  F'Exsmplas,  C>S*t*,  (Rj]«{  100,100,200}.  Th*  formula  will  be 
rspsatad  —  and  axpisinad  --  in  Sactior.  4.2,  on  page  40. 


& 
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A  typical  entry  on  the  agenda  might  look  like  this; 

TASK:  Fill-in  •xamplat  of  Snit 
PRIORITY:  300 
REASONS: 

100:  No  known  oxamplas  for  Soft  to  far. 

100:  Failad  io  fillin  axamplat  of  Sal-union,  for  lack  of  axamplot  of  Salt 
200:  Focus  of  attantion:  AM  raeantly  workad  on  tha  concapt  of  Sal-union 


Notice  the  similarity  of  this  to  the  initial  few  tines  which  AM  types  Just  after  it  selects  a  job 
to  work  on. 

The  flow  of  control  is  simple:  AM  picks  the  task  with  the  highest  priority  value,  and  tries  to 
execute  it.  As  a  side  effect,  new  jobs  occasionally  get  added  to  the  agenda  while  the  task  is 
being  executed. 

The  global  priority  value  of  the  task  also  indicates  how  much  time  and  space  this  task 
deserves.  The  sample  task  above  might  rale  20  cpu  seconds,  and  200  list  cells.  When  either 
of  these  resources  is  used  up,  AM  terminates  work  on  the  task,  and  proceeds  to  pick  a  new 
one.  These  two  limits  will  be  referred  to  in  the  sequel  as  "timelspace  ifj-anta"  which  are 
allocated  to  the  chosen  task.  Whenever  several  techniques  exist  for  satisfying  some  task,  the 
remaining  time/space  quanta  arc  divided  evenly  among  those  alternatives;  i.e.,  each  method 
is  tried  for  a  small  time.  This  policy  of  parceling  out  time  and  space  quanta  is  called 
"activation  energy"  in  [Hewitt  76]  and  called  "resource-limited  processes"  in  [Norman  & 
Bobrow  75].  In  the  case  of  filling  in  examples  of  sets,  the  space  quantum  (200  cells)  will  be 
used  up  quickly  (long  before  the  20  seconds  expire). 

There  are  two  big  questions  now: 

1.  Exactly  how  is  a  task  proposed  and  ranked? 

How  is  a  plausible  new  task  first  formulated? 

How  do  the  supporting  reasons  for  the  task  get  assigned? 

How  does  each  reason  get  assigned  an  absolute  numeric  rating? 

Does  a  task’s  priority  value  change?  When  and  how? 

2.  How  does  AM  execute  a  task,  once  it’s  cliosen? 

Exactly  what  can  bf  done  during  a  task’s  execution? 

The  next  chapter  will  deal  with  both  of  these  questions.  A  detailed  discussion  of  difficulties 
and  limitations  of  these  ideas  can  be  found  in  Section  7.2,  on  page  156. 
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Chapter  4.  Heuristic  Rules 


Assume  that  somehow  AM  has  selected  a  particular  task  from  the  agenda  -  say  "Fill-in 
Exampitt  of  Primaa".  What  precisely  does  AM  do,  in  order  to  execute  the  task?  How  ere 
examples  of  primes  filled  In? 

The  answer  can  be  compactly  stated  as  follows: 

"AM  selects  relevant  heuristics, and  executes  them" 

This  really  just  splits  our  original  question  into  two  new  ones:  (i)  How  are  the  relevant 
heuristics  selected,  and  (ii)  What  does  it  mean  for  heuristics  to  be  executed  (e.g.,  how  does 
executing  a  heuristic  rule  help  to  fill  in  examples  of  primes?). 

These  two  topics  (In  reverse  order)  are  the  two  major  subjects  of  this  chapter.  Although 
several  examples  of  heuristics  will  be  given,  the  complete  list  is  relegated  to  Appendix  3.  ‘ 

The  first  section  explains  what  heuristic  rules  look  like  (their  "syntax",  as  It  were).  The  next 
three  sections  illustrate  how  they  can  be  executed  to  achieve  their  desired  results  (their 
"semantics"). 

Section  4.5  explains  where  the  rules  are  stored  and  how  they  are  accessed  at  the  appropriate 
times. 

Finally,  the  Initial  body  of  heuristics  1$  analyzed.  The  informal  know, edge  they  contain  is 
categorized  and  described.  Unintentionally,  the  distribution  of  heuristics  among  the 
concepts  is  quite  nonhomogeneous;  this  too  is  described  in  Section  4.6. 


4.1.  Syntax  of  the  Heuristics 

Let’s  start  by  seeing  what  a  heuristic  rule  looks  like.  In  general  (see  [Davis  &  King  75]  for 
historical  references  to  production  rules),  it  will  have  the  form 

If  <situational  fiuent> 

Than  <actims> 

As  an  illustration,  here  is  a  heuristic  rule,  relevant  when  checking  examples  of  anything: 


I 


Thtr*  tiMy  art  eondantad  and  phratad  in  Enf  Irth.  Tha  raadar  withinf  to  aaa  axamplaa  of  tha  hauriatka  aa  thay  actually 
wara  codad  in  LISP  ahould  f  lanca  at  Appandix  2.3. 
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lf  th«  current  task  is  to  Check  Examples  of  any  concept  X, 
and  (Forsome  Y)  Y  is  a  cenoralixation  of  X, 
and  Y  has  at  ievtst  10  examples, 
and  all  examples  ot  Y  are  also  examples  of  X, 

Then  print  the  following  conjecture:  X  is  really  no  more  specialized  than  Y, 
and  add  it  to  the  Examples  facet  of  the  concept  named  "Conjectures", 
and  add  the  following  task  to  the  agenda:  "Check  examples  of  Y”,  for  the  reason:  "Just 
as  Y  was  no  more  general  than  X,  one*of  Ganeralizations(Y}  may  turn  out  to 
be  no  more  general  than  Y”,  with  a  rating  for  that  reason  computed  as  the 
average  of:  |jExamples(Generalizations(Y))||,  ||Examples(Y}||,  and 
Priority(Current  task). 

As  with  production  rules,  and  fc  'al  grammatical  rules,  e'.h  of  AM’s  heuristic  rules  has  a 
left-hand-side  and  a  right-hand-side.  On  the  left  is  a  test  to  see  whether  the  rule  is 
applicable,  and  on  the  right  is  a  list  of  actions  to  take  if  the  rule  applies.  The  left-hand-side 
will  also  be  called  the  IF-part,  the  predicate,  the  preconditions,  left  side,  or  the  situational 
fluent  of  the  rule.  The  right-hand-side  will  sometimes  be  referred  to  as  the  THEN-part,  the 
response,  the  right  side,  or  the  actions  part  of  the  rule. 


4.1.1.  Syntax  of  the  Left-hand  Side 

The  situational  fluent  is  a  LISP  predicate,  a  function  which  always  returns  True  or  False 
(in  LISP,  it  actually  returns  either  the  atom  T  or  the  atom  NIL).  This  predicate  may 
investigate  facets  of  any  concept  (often  merely  to  see  whether  they  are  empty  or  not),  use  the 
results  of  recent  tests  and  behaviors  (e.g.,  to  see  how  much  epu  time  AM  spent  trying  to 
work  on  a  certain  task),  etc. 

The  left  side  is  a  conjunction  of  the  form  PI  a  P2  a...  All  the  conjuncts,  except  the  very 
first  one,  are  arbitrary  LISP  predicates.  They  are  only  constrained  to  obey  two 
commandments: 

1.  Be  quick!  (return  either  True  or  False  in  under  0.1  epu  seconds) 

2.  Have  no  side  effects!  (destroying  or  creating  list  structures  or  Lisp  functions,  resetting 

variables) 

Here  are  some  sample  conjuncts  that  might  appear  inside  a  left-hand  side  (but  not  as  the 
very  first  conjunct): 

•  More  than  half  of  the  current  task's  time  quantum  is  already  exhausted,... 

•  There  are  some  known  examples  of  Structures,... 

•  Some  generalization  of  the  current  concept  (the  concept  mentioned  as  part  of  the 
current  task)  has  an  empty  Examples  facets.. 

•  The  space  quantum  of  the  current  task  is  gone,  but  its  time  allocation  is  less  than  lOX 

used  up,... 

•  A  task  recently  selected  had  the  form  "Restructure  facet  F  of  concept  X”,  where  F  is 
any  facet,  and  X  is  the  current  concept^- 
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•  The  Mser  has  used  this  system  at  least  once  before^.. 

•  It’s  Tuesday,... 

The  very  first  conjunct  of  each  left-hand  side  is  special.  Its  syntax  is  highly  constrained.  It 
specifies  the  domain  of  applicability  of  the  rule,  by  naming  a  particular  facet  of  a  particular 
concept  to  which  this  rule  is  relevant. 

AM  uses  this  first  conjunct  as  a  fast  "pre-precondition",  so  that  the  only  rules  whose  left- 
hand  sides  get  evaluated  are  already  known  to  be  somewhat  relevant  to  the  task  at  hand.  In 
fact,  AM  physically  attaches  each  rule  to  the  facet  and  concept  mentioned  in  its  first 
conjunct.*  This  will  be  discussed  ih  more  detail  in  Section  4.5,  "Gathering  relevant 
heuristics".  This  first  conjunct  will  always  be  written  out  as  follows,  in  this  document 
(where  A,  F,  and  C  are  specified  explicitly): 

The  current  task  (the  one  just  selected  from  the  agenda)  is  o.'  the  form  "Do  action  A 
to  th«  F  facet  of  concept  C” 

This  can  be  viewed  as  the  "syntax"  of  the  very  first  conjunct  on  each  rule’s  left-hand  side. 
Here  are  two  typical  examples  of  allowable  first  conjuncts: 

•  The  current  task  (the  one  last  selected  from  the  agenda)  is  of  the  form  "Check  the 
Domain/range  facet  of  concept  X",  where  X  is  any  operation 

•  The  current  task  is  of  the  form  "Fillin  the  examples  facet  of  the  Primes  concept" 

These  are  the  only  guidelines  which  the  left-hand  side  of  a  heuristic  rule  must  satisfy.  Any 
LISP  predicate  which  satisfies  these  constraints  is  a  syntactically  valid  left-hand  side  for  a 
heuristic  rule.  It  turned  out  later  that  this  excessive  freedom  made  it  difficult  for  AM  to 
inspect  and  analyze  and  synthesize  its  own  heuristics;  such  a  need  was  not  foreseen  at  the 
time  AM  was  designed. 

Because  of  this  freedom,  there  is  not  much  more  to  say  about  the  left-hand  sides  of  rules. 
As  the  reader  encounters  heuristics  in  the  next  few  sections,  he  should  notice  the 
(unfortunate)  variety  of  conjuncts  which  may  occur  as  part  of  their  left-hand  sides. 


4.1.2.  Syntax  of  the  Right-hand  Side 

"Running"  the  left-hand-side  means  evaluating  the  series  of  conjoined  little  predicates  there, 
to  see  if  they  all  return  True.  If  so,  we  say  that  the  rule  "triggers".  In  that  case,  the  right- 
hand-side  is  "run",  which  means  executing  all  the  actions  specified  there.  A  single  heuristic 
rule  may  have  a  list  of  several  actions  as  its  right-hand-side.  The  actions  are  executed  in 
order,  and  we  then  say  the  rule  has  finished  running. 

Only  the  right-hand-side  of  a  heuristic  rule  is  permitted  to  have  side  effects.  The  right  side 
of  a  rule  is  a  series  of  little  LISP  functions,  each  of  which  is  called  an  action. 
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ConvortoVi  if  i  includo  th*t  conjunct,  I  noodn't  toll  you  whoi  ^  tho  rult  io  otorod. 
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Semanticatly,  each  action  performs  some  processing  which  is  appropriate  in  some  way  to  the 
kinds  of  situations  in  which  the  left-hand-side  would  have  triggered.  The  final  value  that 
the  action  function  returns  is  irrelevant. 

Syntactically,  there  is  only  one  constraint  which  each  function  or  "action"  must  satisfy.  Each 
action  has  one  of  the  following  3  side-effects,  and  no  other  side-effects: 

1.  It  suggests  a  new  task  for  the  agenda. 

2.  It  causes  a  new  concept  to  be  created. 

3.  It  adds  (or  deletes)  a  certain  entry  to  a  particular  facet  of  a  particular  concept. 

To  repeat:  the  right  side  of  a  rule  contains  a  list  of  actions,  each  of  which  is  one  of  the 
above  three  types.  A  single  rule  might  thus  result  in  the  creation  of  several  new  concepts, 
the  addition  of  many  new  tasks  to  the  agenda,  and  the  filling  in  of  some  facets  of  some 
already-existing  concepts. 

These  three  kinds  of  actions  will  now  be  discussed  in  the  following  three  sections. 


4.2,  Heuristics  Suggest  New  Tasks 

This  section  discusses  the  "proposing  a  new  task"  kind  of  action. 

Here  is  the  basic  idea  in  a  nutshell:  The  left-hand-side  of  ?.  rule  triggers.  Scattered  among 
the  "things  to  do"  in  its  right-hand-side  are  some  suggestions  for  future  tasks.  These  new 
tasks  are  then  simply  added  to  the  agenda  list 


4.2.1.  An  Illustration:  "Fill  in  Ceneralhittionf  of  Equality" 

If  a  new  task  is  suggested  by  a  heuristic  rule,  then  that  rule  must  specify  how  to  assemble 
the  new  task,  how  to  get  reasons  for  it,  and  how  to  evaluate  those  reasons.  For  example, 
here  is  a  typical  heuristic  rule  which  proposes  a  new  task  to  add  to  the  agenda.  It  says  to 
generalize  a  predicate  if  it  is  very  rarely®  satisfied: 

If  fhe  current  teak  was  (Fili-in  examples  of  X), 
and  X  is  a  predicate, 

and  more  than  IOC  items  are  known  in  (ho  domain  of  X, 
and  «)  least  iO  epu  seconds  were  spent  trying  to  randomly  instantiate  X, 
and  the  ratio  of  successes/failures  is  both  >0  and  less  than  .05 
Then  add  the  following  task  to  the  tgonda:  (FilHin  generalizations  of  X),  for  tho  following 
reason: 


O 

The  most  tutpiciout  part  of  tha  ailuationa!  fluan*  (tha  IF-part)  ia  th«  numbar  *0S*.  Whara  did  H  com*  from?  Hint:  if  all 
Humana  had  f  finfara,  thia  would  probably  ba  005  in  bata  f.  Sarioualy,  ont  can  chanfa  thia  vakia  (to  .01  or 
to  25)  with  virtually  no  ehanga  in  AhOa  bahavior.  Ihia  ia  tha  concluaion  of  axparimant  3  (aaa  SMtion 
623)  Such  empirical  juatifkalion  ia  ona  important  raaaon  for  Ktually  writinc  and  running  large  progreme 
like  Akl 
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”y,  it  rtr«>ly  ttiisfitd;  •  slightly  less  restrictive  concept  might  be  more  interesting”. 

This  reason's  rating  is  computed  at  three  timet  the  ratio  of  nonexamplet/examples 
lOund. 

Even  this  Is  one  ful!  step  above  the  actual  LISP  implementation,  where  ”X  it  a  predicate" 
would  be  coded  as  "(MEMBER  X  (EXAMPLES  PREDICATE))”.  The  function  EXAMPLES(X) 
rummages  about  looking  for  already-existing  examples  of  X.  Also,  the  LISP  code  contains 
information  for  normalizing  all  the  numbers  produced,  so  that  they  will  lie  in  the  range  0- 
ICOO. 

Let’s  examine  an  instance  of  where  this  rule  was  used.  At  some  point,  AM  chose  the  task 
"Fillin  oxamplot  of  Litt-aquality".  One  of  the  ways  it  filled  in  examples  of  this  predicate  was 
to  run  it  on  pairs  of  randomly-chosen  lists,  and  observe  whether  the  result  was  True  or 
False^  Say  that  244  random  pairs  of  lists  were  tried,  and  only  twice  was  this  predicate 
satisfied.  Sometime  later,  the  IF  part  of  the  above  heuristic  is  examined.  All  the  conditions 
are  met,  so  it  "triggers".  For  example,  the  "ratio  of  successes  to  failures"  is  just  2/242,  which 
is  clearly  greater  than  zero  and  less  than  0.05.  So  the  right-hand-side  (THEN-part)  of  the 
above  rule  is  executed.  The  right-hand  side  initiates  only  one  action:  the  task  "Fillin 
generalizations  of  List-equality"  is  added  to  the  agenda,  tagged  with  the  reason  "List-equality 
is  rarely  satisfied;  a  slightly  less  restrictive  concept  might  be  more  interesting",  and  that 
reason  is  assigned  a  numeric  rating  of  3x(242/2)  ■  363. 

Notice  that  the  heuristic  rule  above  supplied  a  little  function  to  compute  the  value  of  the 
reason.  That  formula  was:  "three  times  the  ratio  of  examples/nonexamples  found".® 
Functions  of  this  type,  to  compute  the  rating  for  a  reason,  satisfy  the  same  constraints  as  the 
left-hand-side  did:  the  function  must  be  very  fast  and  it  must  have  no  side  effects.  The 
"intelligence"  that  AM  exhibits  in  selecting  which  task  to  work  on  ultimately  depends  on  the 
accuracy  of  these  local  rule  evaluation  formulae.  Each  one  is  .so  specialized  that  it  is  "easy” 
for  it  to  give  a  valid  result;  the  range  of  situations  it  must  juw.ge  is  quite  narrow.  Note  that 
these  little  formulae  were  hand-written,  individually,  by  the  author.  AM  wasn’t  able  to 
create  new  little  reason-rating  formulae. 

The  reason-rating  function  is  evaluated  at  the  moment  the  Job  is  suggested,  and  only  the 
numeric  result  is  remembered,  not  the  original  function.  In  other  words,  we  tack  on  a  list  of 
reasons  and  associated  numbers,  for  each  Job  on  the  agenda.  The  agenda  doesn’t  maintain 
copies  of  the  reason-rating  functions  which  gave  those  numbers.  This  simplification  is  used 
merely  to  save  the  system  some  space  and  time. 

Let’s  turn  now  from  the  reason-rating  formulae  to  the  reasons  themselves.  Each  reason 
supporting  a  newly-suggested  Job  is  simply  an  English  sentence  (an  opaque  string,  a  token). 
AM  cannot  do  much  intelligent  processing  on  these  reasons.  AM  is  not  allowed  to  inspect 
parts  of  it,  parse  it,  transform  it,  etc.  The  most  AM  can  do  is  compare  two  such  tokens  for 
equality.  Of  course,  it  is  not  to  hard  to  imagine  this  capability  extended  to  permit  AM  to 


* 

Th«  Tru*  on««  b«c«mt  •xtmptot  of  Li«t-tqutlity,  and  tht  ptira  of  liata  which  didn’t  aatiafy  thia  pradieato  bacamo  known  aa 
noR-axampiaa  (failuraa,  fo^jaa^..).  A  hauriatic  aitnilat  to  thia  "random  inatantiatien"  ona  ia  illualralad  in 
Saction  44,  on  pa(a  4& 

®  In  Ktuality,  thia  would  ba  ehackad  to  anaura  that  tha  raauH  Kaa  batwaan  0  and  1000. 
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syntactically  analyze  such  strings,  or  to  trivially  compute  some  sort  of  "difference"  between 
two  given  reasons.®  Each  reason  is  assumed  to  have  some  semantic  impact  on  the  user,  and 
is  kept  around  partly  for  that  purpose. 

Each  reason  will  have  a  numeric  rating  (a  number  between  0  and  1000)  assigned  to  it 
locally,  by  the  heuristic  rule  which  proposed  the  task  for  that  reason.  One  global  formula 
will  then  combine  all  the  reasons’  ratings  into  one  single  priority  value  for  the  task. 


4.2.2  The  Ratings  Caine 

In  general,  a  task  on  the  agenda  list  will  have  several  reasons  in  support  of  it.  Each  reason 
consists  of  an  English  phrase  and  a  numeric  rating.  How  can  a  task  have  more  than  one 
reason?  There  are  two  contributing  factors:  (i)  A  single  heuristic  rule  can  have  several 
reasons  in  support  of  a  job  it  suggests,  and  (ii)  When  a  rule  suggests  a  "new"  task,  that  very 
same  task  may  already  exist  on  the  agenda,  with  quite  distinct  reasons  tacked  on  there.  In 
that  case,  the  new  reason(s)  are  added  to  the  already-known  ones. 

One  global  formula  looks  at  all  the  ratings  for  the  reasons,  and  combines  them  into  a  single 
priority  value  for  the  task  as  a  whole.  Below  is  that  formula,  in  all  its  gory  detail; 

Worlh(J)  -  ||SQRT(SUM  Rj2)||  x  [  .2xWorth(A)  ♦  .3xWorlh(F)  ♦  .5xWorth(C>] 

Where  J  «  job  to  be  Judged  ■  (Act  A,  Facet  F,  Concept  C) 
and  {Rj}  are  the  ratinfs  of  the  reasons  supporting  J. 


For  example,  consider  the  job  J  -  (Check  examples  of  Primes).  The  act  A  would  be 
"Check",  which  has  a  numeric  worth  of  100.  The  facet  F  would  be  "Examples",  which  has 
a  numeric  worth  of  700.  The  concept  C  would  be  "Primes",  which  at  the  moment  might 
have  Worth  of  800.  Say  there  were  /bur  rea.sons,  having  values  200,  300,  200,  and  500. 
The  double  lines  "||...ir  indicate  normalization,  which  means  that  the  final  value  of  the 
square-root  must  be  between  0  and  I,  which  is  done  by  dividing  the  result  of  the  Square- 
root  by  1000  and  then  truncating  to  1.0  if  the  result  exceeds  unity. 

In  this  case,  we  first  compute  Sqrt(2002  4  300^  4  200^  4  500^)  <■  Sqrt(420,000),  which  is 
about  648.  After  normalization,  this  becomes  0.648.  The  expression  in  square  brackets  in 
the  formula^  is  actually  computed  as  the  dot-product  of  two  vectors®;  in  this  case  it  is  the 
dot-product  of  (100  700  800)  and  (.2  .3  .5),  which  yields  630.  This  is  multiplied  by  the 
normalized  Square-root  value,  0.648,  and  we  end  up  with  a  final  priority  rating  of  408. 

The  four  reasons  each  have  a  fairly  low  priority,  and  the  total  priority  of  the  task  is 


®  It  w  in  fact  trivial  to  IMAGINE  it.  Of  couraa  DOING  it  ia  quita  a  bit  laaa  trivial.  In  fact,  it  probably  ia  tha  toughaat  of  all  th« 
’opan  raaaarch  problamt"  I'll  propota. 

^  Namaly,  (  0.2xWorth(A)  4  O.SxWorlhtF)  4  0.5xWorth(C) ). 

®  Nainaly>  <Worth(A),  Worth(F),  Worth(C)>  and  <  .2,  3,  .5  >.  Tha  dot-product  of  <al  a2  a3..>  and  <bl  b2  b3...>  ia  dafinad 
aa  (at  x  bl)  4  (a2  x  b2)  4  (a3  x  b3)  4_ 
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therefore  not  great.  It  is,  however,  higher  than  any  single  reason  multiplied  by  0.648.  This 
is  because  there  are  many  distinct  reasons  supporting  it.  The  global  formula  uniting  these 
reasons’  values  does  not  simply  take  the  largest  of  them  (ignoring  the  rest),  nor  does  it 
simply  add  them  up. 

The  above  formula  was  intended  originally  as  a  first  pass,  an  ad  hoc  guess,  which  I  expected 
I’d  have  to  modify  later.  Since  it  has  worked  successfully,  I  have  not  messed  with  it.  There 
is  no  reason  behind  it,  no  justification  for  taking  dot-products  of  vectors,  etc.  I  concluded, 
and  recent  experiments  tend  to  confirm,  that  the  particular  form  of  the  formula  is 
unimportant;  only  some  general  characteristics  need  be  present: 

1.  The  priority  value  of  a  task  is  a  monotone  increasing  function  of  each  of  its  reasons’ 

ratings.  If  a  new  supporting  reason  is  found,  the  task's  value  is  increased.  The 
better  that  new  reason,  the  bigger  the  Increase. 

2.  If  an  already-known  supporting  reason  is  re-proposed,  the  value  of  the  task  is  not 

increased  (at  least,  it’s  not  increased  very  much).  Like  humans,  AM  is  fooled 
whenever  the  same  reason  reappears  in  disguised  form. 

3.  The  priority  of  a  task  involving  concept  C  should  be  a  monotone  increasing 
function  of  the  overall  worth  of  C.  Two  similar  tasks  dealing  with  two  different 
concepts,  each  supported  by  the  same  list  of  reasons  and  reason  ratings,  should  be 
ordered  by  the  worth  of  those  two  concepts. 

I  believe  that  all  of  these  criteria  are  absolutely  essential  to  good  behavior  of  the  system. 
Several  of  the  experiments  discussed  later  bear  on  this  question  (See  Section  6.2,  page 
125).  Note  that  the  messy  formula  given  on  the  last  page  does  incorporate  all  3  of  these 
constraints.  In  addition,  there  are  a  few  features  of  that  formula  which,  while  probably  not 
necessary  or  even  desirable,  the  reader  should  be  informed  of  explicitly: 

1.  The  task’s  value  does  not  depend  on  the  order  in  which  the  reasons  were  discovered. 

This  is  not  true  psychologically  of  people,  but  it  is  a  feature  of  the  particular 
priority-estimating  formula  initially  selected. 

2.  Two  reasons  are  either  consiuered  identical  or  unrelated.  No  attempt  is  made  to 
reduce  the  priority  value  because  several  of  the  reasons  are  overlapping 
semantically  or  even  just  syntacticaly.  This,  too,  is  no  doubt  a  mistake. 

3.  There  is  no  need  to  keep  around  all  the  individual  reasons’  rating  numbers.  The 

addition  of  a  new  reason  will  demand  only  the  knowledge  of  the  number  of  other 
reasons,  and  the  old  priority  value  of  the  task. 

4.  A  task  with  no  reasons  gets  an  absolute  zero  rating.  As  new  reasons  are  added,  the 

priority  *!owly  increases  toward  an  absolute  maximum  which  is  dependent  upon 
the  overall  worth  of  the  concept  and  facet  involved. 

There  is  one  topic  of  passing  interest  which  should  be  covered  here.  Each  possible  Act  A 
(eg.,  Fillin,  Check,  Apply)  and  each  possible  facet  F  (e.g..  Examples,  Definition,  Name(s))  is 
assigned  a  fixed  numeric  value  (by  hand,  by  the  author).  These  values  are  used  inside  the 
formula  on  the  last  page,  where  it  says  ‘Worth(A)’  and  'Worth(F)’.  They  are  fairly  resistant 
to  change,  but  certain  orderings  should  be  maintained  for  best  results.  E.g.,  "Examples" 
should  be  rated  higher  than  "Specializations",  or  else  AM  may  whirl  away  on  a  cycle  of 
specialization  long  after  the  concept  has  been  constrained  Into  vacuousness.  As  for  the  Acts, 
their  precise  values  turned  out  to  be  even  less  Important  than  the  Facets’. 

Now  that  we’ve  seen  how  to  compute  this  priority  value  for  any  given  task,  let’s  not  forget 
what  it’s  used  for.  The  overall  rating  has  two  functions: 
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(I)  The  tasks  on  the  agenda  list  are  ordered  by  their  ratings,  and  AM  always  chooses 
the  top  task.  Thus  this  rating  determines  which  task  to  execute  next.  This  is  not  an 
ironclad  policy;  In  reality,  AM  prints  out  the  top  few  tasks,  and  the  user  has  the 
option  of  interrupting  and  directing  AM  to  work  on  one  of  those  other  tasks 
instead  of  the  very  top  one. 

(ii)  Once  a  task  is  chosen,  its  overall  rating  determines  how  much  time  and  space  AM 
will  expend  on  it  before  quitting  and  moving  on  to  a  new  task.  The  precise 
formulae  ^re  unimportant.  Roughly,  the  0-1000  rating  is  divided  by  ten  to 
determine  how  much  time  to  allow,  in  cpu  seconds.  The  rating  is  divided  by  two  to 
determine  how  much  space  to  allow,  in  list  cells. 


Recall  that  a  heuristic  rule’s  actions  are  of  three  types; 

1 .  Suggest  new  tasks  and  add  them  to  the  agenda. 

2.  Create  a  new  concept. 

3.  Fill  in  some  entries  for  a  facet  of  a  concept. 

This  subsection  discusses  the  second  activity. 

Here  is  the  basic  idea  in  a  nutshell;  Scattered  among  the  "things  to  do"  in  the  right-hand- 
side  of  a  rule  are  some  requests  to  create  .specific  new  concepts.  For  each  such  request,  the 
heuristic  rule  must  specify  how  to  construct  it.  At  least,  the  rule  must  specify  ways  of 
assembling  enough  facets  of  the  new  concept  to  disambiguate  it  from  all  the  other  known 
concepts.  Typically,  the  rule  will  explain  how  to  fill  in  the  Definition  of  -  or  an  Algorithm 
for  -  the  new  concept.  After  executing  these  instructions,  the  new  concept  will  "exist",  and 
a  few  of  its  facets  will  be  filled  in,  and  a  few  new  jobs  will  probably  exist  on  the  agenda, 
indicating  that  AM  might  want  to  fill  in  certain  other  facets  of  this  new  concept  In  the  near 
future. 


4.3.1.  An  Illustration:  Discovering  Primes 

Here  is  a  heuristic  rule  that  results  in  a  new  concept  being  created; 

If  (h«  current  task  was  (Fill-in  examples  of  F), 

and  F  is  an  operation  from  domain  space  A  into  range  space  B, 

and  more  than  100  items  are  Known  examples  of  A  (in  the  domain  of  F), 

and  more  than  10  range  items  (in  B)  were  found  by  applying  F  to  these  domain  items, 

and  at  least  1  of  these  range  items  is  a  distinguished  member  (esp:  extremum)^  of  B, 
Then  (for  each  such  distinguished  member  V<B}  create  the  following  new  concept: 


g 

Tht«  i«  handled  ««  followt.’  AM  tikef  the  f iven  lift  of  range  ilema  It  eliminatea  any  which  are  not  intareatinf  (according  to 
Intaratta(B))  or  extrama  (an  entry  on  B.Exa-Bdy,  the  boundary  examplet  of  B).  Finally,  all  thoae  extreme 
range  itemt  are  moved  to  the  front  of  thia  liat.  AM  bagint  walking  down  this  liat,  creating  new  concepta 
according  to  the  rule  Sooner  or  later,  a  timer  (or  a  atoraga-apaca-watcher)  will  terminate  thia  coatly 
activity.  Only  the  frontmoat  few  range  Hema  on  the  liat  will  have  generated  new  concepta.  So  ’eapecic'ly” 
really  juat  maana  priority  conaidaration. 
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Nam«:  F-Inverse-of-b 
Definition:  (x)  (  F(x)  is  b  ) 

Generalization:  A 

Worth:  Average(Worth(A),  Worth(F),  Worth(B),  Worth(b),  ||Examples(B)||) 
Interest:  Any  conjecture  involving  both  this  concept  and  either  F  or  Inverse(F) 


In  case  the  user  asks,  the  reason  for  doing  this  is:  "Worthwhile  investigating  those  A’s  which 
have  an  unusual  F-value,  namely,  those  whose  F-value  is  b" 

The  total  amount  of  time  to  spend  right  now  on  all  of  these  new  concepts  is  computed  as: 

Half  the  remaining  epu  time  in  the  current  task's  time  quantum. 

The  total  amount  of  space  to  spend  right  now  on  each  of  these  new  concepts  is  computed  as: 
The  remaining  space  quantum  for  the  current  task. 


Although  some  examples  of  F-inverse-of-b  might  be  easily  obtained  (or  already  known)  at 
the  moment  of  its  creation,  the  above  rule  doesn’t  specifically  tell  AM  how  to  fill  in  that 
facet.  The  very  last  line  of  the  heuristic  indicates  that  a  few  epu  seconds  may  be  spent  on 
just  this  sort  of  activity:  filling  in  facets  of  the  new  concept  which,  though  not  explicitly 
mentioned  in  the  rule,  are  easy  to  fill  in  now.  Any  facet  X  which  didn’t  get  filled  in  "right 
now"  will  probably  cause  a  new  task  to  be  added  to  the  agenda,  of  the  form:  "Fillin  fsc«t  X  of 
conetpf  F"invrss«of-b*.  Eventually,  AM  would  choose  that  task,  and  spend  a  large 
quantum  of  time  working  on  that  single  facet. 

Heuristics  for  the  new  concept  are  quite  hard  to  fill  in.  This  was  one  of  AM’s  most  serious 
limitations,  in  fact  (see  Chapter  7).  Above,  we  see  a  trivial  kind  of  "heuristic  schema"  or 
template,  which  gets  instantiated  to  provide  one  new,  specialized  heuristic  about  the  new 
concept.  That  new  heuristic  tells  how  to  Judge  the  interestingness  of  any  conjecture  which 
crops  up  involving  this  new  concept.  Whenever  such  conjectures  get  proposed,  they  are 
evaluate  by  calling  on  just  such  heuristics. 

Now  let’s  look  at  an  instance  of  when  this  heuristic  was  used.  At  one  point,  AM  was 
working  on  the  task  "Fill-in  examples  of  Divisors-of". 

This  heuristic’s  IF-part  was  triggered  because:  Divisors-of  is  an  operation  (from  Numbers  to 
Sets  of  numbers),  and  far  more  than  100  different  numbers  are  known,  and  more  than  10 
different  sets  of  factors  were  found  altogether,  and  some  of  them  were  distinguished  by 
being  extreme  kinds  of  sets:  empty-sets,  singletons,  doubletons  and  tripletons. 

After  its  left  side  triggered,  the  right  de  of  the  heuristic  rule  was  executed.  Namely,  four 
new  concepts  were  created  immediately.  Here  is  one  of  them: 
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Nam*:  Divi«or$-o<-lnv*r8#-of-Doubl«ton 
Definition:  X  (x)  (  Divitorfol(x)  it  a  Doubleton  ) 

Generalization:  Numbers 
Worth:  100 

Interest:  Any  conjecture  involvine  both  this  concept  and  either  Divisors-of  or  Times 


This  is  a  concept  representing  a  certain  class  of  numbers,  in  fact  the  numbers  we  call 
primes.  The  heuristic  resets  a  certain  variable,  so  that  in  case  the  user  interrupts  and  asks 
Why},  AM  Informs  him: 

"This  concept  was  created  because  it’s  worthwhile  investigating  those  numbers  which 
have  an  extreme  divisors-of  value;  in  this  case,  numbers  which  have  only  two  divisors". 


AM  was  willing  to  spend  half  the  remaining  quantum  of  time  allotted  to  "Fillin  oxamplot  of 
Divitort-of*  on  these  four  new  concepts'®. 

The  heuristic  rule  is  applicable  to  any  operation,  not  just  numeric  ones.  For  example,  when 
AM  was  hlting  in  examples  of  Set-Intersection,  it  was  noticed  that  some  pairs  of  sets  were 
mapped  Into  the  extreme  kind  of  set  Empty-set.  The  above  rule  then  had  AM  define  the 
concept  of  Disjointness-,  pairs  of  sets  having  empty  intersection. 


4.3.2.  The  Theory  of  Creating  New  Concepts 

All  the  heuristic  rule  must  do  is  to  fill  in  enough  facets  so  that  the  new  concept  is 
disambiguated  from  all  the  others,  so  that  it  is  "defined"  clearly.  Should  AM  pause  and  fill 
in  lots  of  facets  at  that  time?  After  all,  several  pieces  of  information  are  trivial  to  obtain  at 
this  moment,  but  may  be  hard  to  reconstruct  later  (e.g.,  the  reason  why  C  was  created).  On 
the  other  hand,  filling  in  anything  without  a  good  reason  is  a  bad  idea  (it  uses  up  time  and 
space,  and  it  won’t  dazzle  the  user  as  a  brilliant  choice  of  activity). 

So  the  universal  motto  of  *  M  is  to  fill  In  facets  of  a  new  concept  If  -  and  only  if  -  that 
filling-in  activity  will  be  mi:ch  easier  at  that  moment  than  later  on. 

In  almost  all  cases,  the  following  facets"  will  be  specified  explicitly  in  the  heuristic  rule,  and 
thus  will  get  filled  in  right  away:  Definitions,  Algorithms,  Domain/range,  Worth,  plus  a  tie  to 


SoTM  trK/i«l  dttail*.  0nt-ti|hth  of  tht  r«mainifl{  timt  i*  tpanf  on  uch  of  thott  4  concoptt:  Numbtro-wifh-O-divitoro, 
Numbtrt-with- 1 -divisor,  Numbtrt-ws<h-2-divitor»,  Numb«r«-wi1h-3-divitort.  Tht  orifintl  timo/tpac*  limit* 
wtr*  in  rtality  about  25  epu  ttcond*  and  800  liat  ctlls,  and  at  th*  momtnt  thi*  h«uristie  wa*  calltd,  only 
about  to  aaconda  and  600  call*  ramainad,  *o  *|.  tha  concapt  Prima*  wa*  allottad  only  1.2  epu  aacorid*  to 
*Sat  off  tha  ground".  This  was  no  problem,  at  it  used  far  less  than  that.  The  hauriatic  rule  atatas  that  each 
of  th*  four  naw  concapt*  may  uaa  up  th*  full  ramaining  apse*  allocation  (600  call*),  and,  a.g..  Prim** 
naadad  only  a  fraction  of  that  initially. 

' '  Th*  raadar  may  wiah  to  glanc*  ahaad  to  Saction  5.2,  pag*  67  to  not*  th*  full  rang*  of  facat*  that  any  concapt  may 
poas***:  what  thair  nama*  ara,  and  th*  kind  of  information  that  i*  atorad  in  aach. 
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some  related  concept  (e.g.,  if  the  new  concept  is  a  generalization  of  Equality,  then  we  can 
trivially  fill  in  an  entry  on  its  Specializations  facet:  "Equality".) 

On  the  other  hand,  the  following  facets  will  not  be  trivial  to  fill  in:  Conjectures,  Examples, 
Generalizations,  Specializations,  and  Interestingness.  For  example,  filling  in  the 
Specializations  facet  of  a  new  concept  may  involve  creating  some  new  concepts;  finding  some 
entries  for  its  Conjectures  facet  may  involve  a  great  deal  of  experimenting;  finding  some 
Examples  of  it  may  involve  twisting  its  definition  around  or  searching.  None  of  these  is 
easier  to  do  at  time  of  creation  than  any  other  time,  so  it’s  deferred  until  some  reason  for 
doing  it  exists. 

For  each  such  "time-consuming"  facet  F,  of  the  new  concept  C,  one  new  task  gets  added  to 
the  agenda,  of  the  form  "Fill  in  tnirias  for  facet  F  of  concept  C",  with  reasons  of  the  form 
"Because  C  was  just  created,"  and  also  "No  entries  exist  so  far  on  C.F"'^  Most  of  the  tasks 
generated  this  way  will  have  low  priority  rating,  and  may  stay  near  the  bottom  of  the 
agenda  until/unless  they  are  re-suggested  for  a  new  reason. 

Using  the  Primes  example,  from  the  last  subsection,  we  see  that  a  new  task  like  "Fillin 
tptelalizaliont  of  Primal"  was  suggested  with  a  low  rating,  and  "Fillin  •xamplat  of  Primot"  was 
suggested  with  a  mediocre'®  rating.  The  ratings  of  these  tasks  Increase  later  on,  when  the 
same  tasks  are  re-proposed  for  new  reasons. 


4.S.S.  Another  Illustration:  Squaring  a  number 

Let’s  take  another  simple  (but  not  atypical)  illustration  of  how  new  concepts  get  created. 
(The  reader  may  skip  this  subsection;  it  contains  more  details  about  how  AM  actually  sets 
up  new  concepts.) 

Assume  that  AM  has  recently  discovered  the  concept  of  multiplication,  which  it  calls 
TIMES,"  and  AM  decides  that  it  is  very  interesting.  A  heuristic  rule  exists  which  says:'’’ 

If  a  nawly-intarastins  operation  F(x,y)  takes  a  pair  of  N's  as  arguments, 

Then  create  a  new  concept,  a  specialization  of  F,  called  F-ltself,  taking  just  one  N  as 
argument,  defined  as  F(x,x),  with  initial  worth  Wcrth(F). 

’ )  the  case  of  F  ■  TIMES,  we  see  that  F  takes  a  pair  of  numbers  as  its  arguments,  so  the 
heuristic  rule  would  have  AM  create  a  new  concept  called  TIMES-Itself,  defined  as  TIMES- 
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proettt. 

14 
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Itself(x)  ■  TIMES(x,x).  That  is,  create  the  new  concept  which  is  the  operation  of  squaring  a 
number. 


What  would  AM  do  in  this  situation?  The  global  list  of  concepts  would  be  enlarged  to 
include  the  new  atom  TIMES-Itself’,  and  the  facets  of  this  new  concept  would  begin  to  be 
filled  in.  The  following  facets  would  get  filled  in  almost  instantly: 


The  name,  definition,  domain/range,  generalizations,  and  worth  are  specified  explicitly  by 
the  heuristic  rule. 

The  lambda  expression  stored  under  the  definition  facet  is  an  executable  LISP  predicate, 
which  accepts  two  arguments  and  then  tests  them  to  see  whether  the  second  one  is  equal  to 
TIMES-Itself  of  the  first  argument.  It  performs  this  test  by  calling  upon  the  predicate 
stored  under  the  definition  facet  of  the  TIMES  concept.  Thus  TIMES-Itself.Defn(4,l6)  will 
call  on  TIMES.Defn('i,4,16),  and  return  whatever  value  that  predicate  returns  (in  this  case, 
it  returns  True,  since  4x4  does  equal  16). 

A  trivial  transformation  of  this  definition  provides  an  algorithm  for  computing  this 
operation.  The  algorithm  says  to  call  on  the  Algorithms  facet  of  the  concept  TIMES.  Thus 
TIMES-Itself.Alg(4)  is  computed  by  calling  on  TIMES.AIg(4,4)  and  returning  that  value 
(namely,  16). 

The  worth  of  TIMES  was  800  at  the  moment  TIMES-Itself  was  created,  and  this  becomes 
the  worth  of  TIMES-Itself. 

TIMES-Itself  Is  by  definition  a  specialization  of  TIMES,  so  the  SPECIALIZATIONS  facet 
of  TIMES  Is  incremented  to  point  to  this  new  concept.  Likewise,  the 
GENERALIZATIONS  facet  of  TIMES-Itself  points  to  TIMES. 

Note  how  easy  it  was  to  fill  in  these  facets  now,  but  how  difficult  it  might  be  later  on,  "out  of 
context".  By  way  of  contrast,  the  task  of,  e.g.,  filling  in  SpeclaUzutions  of  TIMES-Itself  will 
be  no  harder  later  on  than  it  is  right  now,  so  we  may  as  well  defer  it  until  there’s  a  good 
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reason  for  it.  This  task  will  probably  be  added  to  the  agenda  with  so  low  a  priority  that 
AM  will  never  get  around  to  it,  unless  some  new  reasons  for  it  emerge. 

The  task  "Fill-in  •xamples  of  TIMES-IUoll"  is  probably  worthwhile  doing  soon,  but  again  it 
won't  be  any  harder  to  do  at  a  later  time  than  it  is  right  now.  So  it  is  not  done  at  the 
moment;  rather,  it  is  added  to  the  agenda  (with  a  fairly  high  priority). 

Incidentally,  the  reader  may  be  Interested  to  know  that  the  risxt  few  tasks  AM  selected  (in 
reality)  were  to  create  the  inverse  of  this  operation  (i.e.,  integer  square-root),  and  then  to 
create  a  new  kind  of  number,  the  ones  which  can  be  produced  by  squaring  (i.e.,  perfect 
squares).  Perfect  squares  were  deemed  worth  having  around  because  Integer-square-root  is 
defined  precisely  on  that  set  of  integers. 


4A  Heuristics  Fill  in  Entries  for  a  Specific  Facet 

The  last  two  subsections  dealt  with  how  a  heuristic  rule  is  able  to  propose  new  tasks  and 
create  new  concepts.  This  section  will  illustrate  how  a  rule  can  find  some  entries  for  a  given 
facet  of  a  specific  concept. 

Typically,  the  facet/concept  involved  will  be  the  one  mentioned  in  the  current  task  which 
was  chosen  from  the  agenda.  If  the  task  "Flllin  Examplts  of  Sol-union"  were  plucked  from  the 
agenda,  then  the  "relevant"  heuristics  would  be  those  useful  for  filling  In  entries  for  the 
Examples  facet  of  the  Set-union  concept. 

There  Is  an  Important  class  of  exceptions  to  this,  however:  conjectures.  Some  rules  will 
specify  plausible  relationships  to  look  for;  If  found,  they  constitute  a  new  conjecture.  For 
example,  the  reader  will  see  In  Section  4.4.4,  on  page  62,  that  the  unique  factorization 
theorem  is  proposed  merely  as  an  observation  of  the  form  "The  range  of  operation  F  is  not 
just  B  but  rather  the  more  specialized  concept  BB".  The  particular  case  of  the  unique 
factorization  theorem  leads  to  this  statement:  "The  range  of  Prime-factorings'®  is  not  just 
"Sets’  but  rather  “Singletons’."  In  fact,  this  whole  conjecture  Is  recorded  by  merely  replacing 
<Number-»Set>  by  <Number-*Singleton>  as  an  entry  on  the  Domain/range  facet  of  the 
concept  Prime-factorings. 

The  reader  may  be  surprised  to  learn  that  the  only  kind  of  conjecture  AM  can  make  is  of 
that  form  (add  a  new  entry  to  some  facet  of  some  concept)'®.  Apparently,  this  is  sufficient  to 
plausibly  notice  and  state  most  interesting  conjectures.  Good  definitiohs  make  the  sta'.ements 
of  theorems  short  and  simple.'^ 


'®  Prim«-ftctorinf«(x),  «l*o  ctlltd  Prlm«-litn«t(x),  w  th«  Ml  of  til  bt|«-of-prim««  whoio  product  io  x;  i.o.,  til  wtyo  of 
ftetoring  x  into  primtt. 

'®  That’t  why  'conjocturinf*  it  eitttifitd  under  tht  'tdd-tn-tntry’  typo  of  nturittic  rule  action. 

'  Extreita  for  tht  doubtinc  rtidtr;  Stata  tha  uniqua  factorization  thaoram  in  puraly  aat-thaoratic  tarma.  Sarioutly,  ona 
important  way  that  dafinitiona  art  invantad  ia  to  aaa  what  bulky  conatruct  in  a  thaoram  can  ba  collapa^  into 
a  ainfla  tarm.  Typically  ona  hopaa  that  tha  tarm  will  ba  utad  aittwhart,  of  eouraa. 
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We'll  take  thPie  two  kinds  of  "filling  in  entries"  one  at  a  time:  first  the  standard  "find  an 
entry  for  the  facet  of  the  concept  mentioned  in  the  current  task",  followed  by  the  interesting 
but  rarer  activity  of  "looking  for  a  conjecture". 


4.4.1.  An  Illustration;  "Fill  in  Examples  of  Set-union" 

Recall  that  a  task  is  typically  of  the  form  "Fill  in  facet  F  of  concept  C".  How  can  executing 
relevant  heuristic  rules  satisfy  such  a  task?  This  subsection  illustrates  how  a  heuristic  rule 
might  be  executed  to  find  some  entries  for  the  facet  designated  by  the  current  task. 

A  typical  heuristic,  attached  to  the  concept  Activity,  says: 

1  It 

If  th«  current  task  is  to  fill  in  examples  of  the  activity  F, 

One  way  to  get  them  is  to  run  F  on  randomly  chosen  examples  of  the  domain  of  F. 

Of  course,  in  the  LISP  implementation,  this  situation-action  rule  is  not  coded  quite  so 
neatly,  it  would  be  more  faithfully  translated  as  follows: 

If  CURRENT-TASK  matches  (FILLIN  EXAMPLES  F*-anything)), 
and  F  isa  Activity, 

Then  carry  out  the  following  procedure: 

1 .  Find  the  domain  of  F,  and  call  it  D$ 

2.  Find  examples  of  D,  and  call  them  E; 

3.  Find  an  algorithm  to  compute  F,  and  call  it  A; 

4.  Repeatedly: 

4a.  Choose  any  member  of  E,  and  call  It  El. 

4b.  Run  A  on  and  call  the  result  X. 

4c.  Check  whether  <E1,X>  satisfies  the  definition  of  F. 

4d.  If  so,  then  add  <E1  X>  to  the  Examples  facet  of  F. 

4e.  If  not,  then  add  <E1  -»  X>  to  the  Non-examples  facet  of  F. 

Let's  take  a  particular  Instance  where  this  rule  would  be  useful.  Say  the  current  task  is  "Fillin 
examples  of  Set-union".  The  left-hand-side  of  the  rule  is  satisfied,  so  the  right-hand-side  is 
run. 

Step  (1)  says  to  locate  the  domain  of  Set-union.  The  facet  labelled  Domain/Range,  on  the 
Set-union  concept,  contains  the  entry  (SET  SET  -♦  SET),  which  indicates  that  the  domain  is 
a  pair  of  sets.  That  is.  Set-union  is  an  operation  which  accepts  (as  its  arguments)  a  pair  of 
sets,  and  returns  (as  its  value)  some  new  set. 

Since  the  domain  elements  are  sets,  step  (2)  says  to  locate  examples  of  sets.  The  facet 
labelled  Examples,  on  t.ie  Sets  concept,  points  to  a  list  of  about  30  different  sets.  This 
includes  {Z},  {A,B,C,D.E},  {},  {A.{{B)}}.... 

Step  (3)  involves  nothing  more  than  accessing  some  randomly-chosen  entry  on  the 
Algorithms  facet  of  Set-union.  One  such  entry  is  a  recursive  LISP  function  of  two 
arguments,  which  halts  when  the  first  argument  is  the  empty  set,  and  otherwise  pulls  an 


18 


'Activity*  i*  •  |»n«rtl  concept  which  iriclud**  op«r*tion«,  pr*dicit««,  r*l«tion«,  function*,  *tc 


Ch«pt*r  4 


AM:  Oitcovtry  in  Mith«m*tic*  *«  H«uri«tic  S««rch 


-49- 


element  out  of  that  set  and  SET-INSERT’s  it  into  the  second  argument,  and  then  recurses 
on  the  new  values  of  the  two  sets.  For  convenience,  we'll  refer  to  this  algorithm  as  UNION. 

We  then  enter  the  loop  of  Step  (4).  Step  (4a)  has  us  choose  one  pair  of  our  examples  of 
sets,  say  the  first  two  {Z}  and  {A,B,C,D,E}.  Step  (4b)  has  us  run  UNION  on  these  two  sets. 
The  result  Is  {A,B,C,D,E,Z}.  Step  (4c)  has  us  grab  an  entry  from  the  Definitions  facet  of 
Set-union,  and  run  it.  A  typical  definition  is  this  formal  one; 

(\  (SI  S2  $3) 

(AND 

(For  all  X  in  Si,  X  is  in  S3) 

(For  all  X  in  S2,  x  is  in  S3) 

(For  all  X  in  S3,  x  is  in  SI  or  x  is  in  S2) 

) 

)  ) 


It  is  run  on  the  three  arguments  S1«{Z},  S2-{A,B,C,D,E},  S3«{A,B,C,D,E,Z}.  Since  it  returns 
"True",  we  proceed  to  Step  (4d).  The  construct  <{Zj,  {A,B,C,D,E}  -♦  {A,B,C,D,E,Z}>  is  added 
to  the  Examples  facet  of  Set-union. 

At  this  stage,  control  returns  to  the  beginning  of  the  Step  (4)  loop.  A  new  pair  of  sets  is 
chosen,  and  so  on. 

But  when  would  this  loop  stop?  Recall  that  each  task  has  a  time  and  a  space  allotment 
(based  on  its  priority  value).  If  there  are  many  different  rules  all  claiming  to  be  relevant  to 
the  current  task,  then  each  one  is  allocated  a  small  fraction  of  those  time/space  quanta. 
When  either  of  these  resources  is  exhausted,  AM  would  break  away  at  a  "clean"  point  (Just 
after  finishing  a  cycle  of  the  Step  (4)  loop)  and  would  m  on  to  a  new  heuristic  rule  for 
filling  in  examples  of  Set-union. 

This  concludes  the  demonstration  that  a  heuristic  rule  really  can  be  executed  to  produce  the 
kinds  of  entities  requested  by  the  current  task. 


4.4.2.  Heuristics  Propose  New  Con  ie 


We  saw  in  the  sample  excerpt  (Chapter  2)  that  AM  occasionally  notices  some  unexpected 
relationship,  and  formulates  it  into  a  precise  conjecture.  Below  is  an  example  of  how  this  is 
done.  As  you  might  guess  from  the  placement  of  this  subsection,'^  the  mechanis  is  our 
old  friend  the  heuristic  rule  which  fills  in  entries  for  certain  facets. 


In  fact,  a  conjecture  evolves  through  four  stages: 

1.  A  heuristic  rule  looks  for  a  particular  kind  of  relationship.  This  will  typically  be  of 
the  form  "X  is  a  Generalization  of  Y",  or  "X  is  an  example  of  Y",  or  "X  is  the 
same  as  Y",  or  "Fl.Defn(X,Y)"  where  FI  is  an  active  concept  AM  knows  about,  or 
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"Fl.Defn(Y.X)"2° 

2.  Once  found,  the  relationship  is  checked,  using  supporting  contacts.  A  great  deal  of 

empirical  evidence  must  favor  it,  and  any  contradictory  evidence  must  be 
"explained  away"  somehow. 

3.  Now  it  is  believed,  and  AM  prints  it  out  to  the  user.  It  is  added  as  a  new  entry  to 

the  Conjees  facet  of  both  concepts  X  and  Y.  It  is  also  added  as  an  entry  to  the 
Examples  facet  of  the  Conjecture  concept. 

4.  Eventually,  AM  will  get  around  to  the  task  "Check  Examples  of  Conjecture",  or  to  the 

task  "Check  Conjees  of  X".  If  AM  had  any  concepts  for  proving  conjectures,  they 
would  then  be  invoked.  In  the  current  LISP  implementation,  these  are  absent. 
Nevertheless,  several  "checks"  are  performed:  (t)  see  if  any  new  empirical  evidence 
(pro  or  con)  has  appeared  recently;  (it)  see  if  this  conjecture  can  be  strengthened; 
(tii)  check  it  for  extreme  cases,  and  modify  it  if  necessary;  (Iv)  Modify  the  worth 
ratings  of  the  concepts  involved  in  the  conjecture. 

The  left-hand-side  of  such  a  heuristic  rule  will  be  longer  and  more  complex  than  most  other 
kinds,  but  the  basic  activities  of  the  right-hand-side  will  still  be  filling  in  an  entry  for  a 
particular  facet. 

The  entries  filled  in  will  include;  (i)  a  new  example  of  Conjectures,  (ti)  a  new  entry  for  the 
Conjee  facet  of  each  concept  involved  in  the  conjecture,  {Hi)  if  we’re  claiming  that  concept 
X  is  a  generalization  of  concept  Y,  then  "X"  would  be  added  to  the  Generalizations  facet  of 
Y,  and  "Y"  added  to  the  Specializations  facet  of  X,  {iv)  if  X  is  an  Example  of  Y,  "X"  is 
added  to  the  Examples  facet  of  Y,  and  "Y"  is  added  to  the  ISA  facet  of  X. 

The  right-hand-side  may  also  involve  adding  new  tasks  to  the  agenda,  creating  new 
concepts,  and  modifying  entries  of  particular  facets  of  particular  concepts.  As  is  true  of  all 
heuristic  rules,  both  sides  of  this  type  of  conjecture-perceiving  rule  may  run  any  little 
functions  they  want  to:  any  functions  which  are  quick  and  have  no  side  effects  (e.g., 
FORALL  tests,  PRINT  functions,  accesses  to  a  specified  facet  of  some  concept). 


4.4.S.  An  liiustration:  "All  primes  except  2  are  odd" 

As  an  illustration,  here  is  a  heuristic  rule,  relevant  when  checking  examples  of  any  concept: 
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ll  lh«  current  task  is  to  Check  Examples  of  X, 
and  (Forsome  Y)  Y  is  a  generalization  of  X, 
and  Y  has  at  least  10  examples, 

and  alt  examples  of  Y  (ignoring  boundary  cases)  are  also  examples  of  X, 

Then  print  the  following  conjecture:  X  is  really  no  more  specialized  than  Y, 
and  add  it  to  the  Examples  facet  of  Conjectures, 

and  if  the  user  asks,  inform  him  that  the  evidence  for  this  was  that  all  ||Examples(Y}||  Y's 
(ignoring  boundary  examples  of  Y's)  turned  out  to  be  X't  as  well, 
and  Check  the  truth  of  this  conjecture  on  boundary  examples  of  Y, 
and  add  "X"  to  the  Generalizations  facet  of  Y, 
and  add  "Y"  to  the  Specializations  facet  of  X, 

and  (if  there  is  an  entry  in  the  Generalizations  facet  of  Y)  add  the  following  task  to  the 
agenda  "Check  examples  of  Y",  for  the  reason:  "Just  as  Y  was  no  more 
general  than  X,  one*of  Generalizations(Y)  may  turn  out  to  be  no  more 
general  than  Y",  with  a  rating  for  that  reason  computed  as: 
0.4x||Examples(Generalizationc(Y))||  ♦ 

0.3xjjExamples(Y)i|  * 

0.3xPriority(Current  task). 

Let’s  take  a  particular  Instance  where  this  rule  would  be  useful.  Say  the  current  task  is 
"Check  examples  of  Odd-primes".  The  left-hand-side  of  the  rule  is  run,  and  is  satisfied  when 
the  generalization  Y  Is  the  concept  "Primes".  Let’s  see  why  this  is  satisfied. 

One  of  entries  of  the  Generalization  facet  of  Odd-primes  is  "Primes".  AM  grabs  hold  of 
the  30  examples  of  primes  (located  on  the  Examples  facet  of  Primes),  and  removes  the  ones 
which  are  tagged  as  boundary  examples  ("2"  and  "3").  A  definition  of  Odd-prime  numbers 
is  obtained  (Definitions  facet  of  Odd-primes),  and  it  is  run  on  each  remaining  example  of 
primes  (5,  7,  11,  13,  17,  ...).  Sure  enough,  they  ail  satisfy  the  definition.  So  all  primes 
(ignoring  boundary  cases)  appear  to  be  odd.  The  left-hand-side  of  the  rule  is  satisfied. 

At  this  point,  the  user  sees  a  message  of  the  form  "Odd-primes  is  really  no  more  specialized 
than  Primes".  If  he  interrupts  and  asks  about  It,  he  is  told  that  the  evidence  for  this  was 
that  all  30  primes  (ignoring  boundary  examples  of  primes)  turned  out  to  be  Odd-primes. 

Of  the  boundary  e:.amples  (the  numbers  2  and  3),  only  the  integer  "2"  falls  to  be  an  odd- 
prime,  so  the  the  user  Is  notified  of  the  finalized  conjecture:  "All  primes  (other  than  *2’)  are 
also  odd-primes".  Tnis  Is  added  as  an  entry  on  the  Examples  facet  of  the  concept  named 
‘Conjectures.’ 

Before  beginning  all  this,  the  Generalizations  facet  of  Odd-primes  pointed  to  Primes.  Now, 
this  rule  has  us  add  "Primes"  as  an  entry  on  the  Specializations  facet  of  Odd-primes.  Thus 
Primes  is  both  a  generalization  and  a  specialization  of  Odd-primes  (to  within  a  single  stray 
exception),  and  AM  will  be  able  to  treat  these  two  concepts  as  if  they  were  merged  together. 
They  are  still  kept  separate,  however.  In  case  AM  ever  needs  to  know  precisely  what  the 
difference  between  them  is,  or  In  case  later  evidence  shows  the  conjecture  to  be  false^'. 
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The  final  action  of  the  right-hand-side  of  this  rule  is  to  propose  a  new  task  (if  there  exist 
some  generalizations  of  the  concept  Y,  which  in  our  case  is  "Primes").  So  AM  accesses  the 
Generalizations  facet  of  Primes,  which  is  "(Numbers)".  A  new  task  is  therefore  added  to  the 
agenda:  "Check  examples  of  Primes",  with  an  associated  reason:  "Just  as  Primes  was  no  more 
general  than  Odd-primes,  so  Numbers  may  turn  out  to  be  no  more  general  than  Primes". 
The  reason  is  rated  according  to  the  formula  given  in  the  rule;  say  it  gets  the  value  500. 

To  make  this  example  a  little  more  interesting,  let’s  suppose  that  the  task  "Chock  oxampios  of 
Primos"  already  existed  on  the  agenda,  but  for  the  reasoin  "Many  examples  of  Primes  have 
been  found,  but  never  checked",  with  a  rating  for  the  reason  of  100,  and  for  the  task  as  a 
whole  of  200.  The  global  task-rating  formula  then  assigns  the  task  a  new  overall  priority  of 
600,  because  of  the  new,  fairly  good  reason  supporting  it. 

When  that  task  is  eventually  chosen,  the  heuristic  rule  pictured  above  (at  the  beginning  of 
this  subsection)  will  trigger  and  will  be  run  again,  with  X^Primes  and  Y»Numbers.  That  is, 
AM  will  be  considering  whether  (almost)  all  numbers  are  prime.?.  The  left-hand-side  of  the 
heuristic  rule  will  quickly  fail,  since,  e.g.,  "6"  is  an  example  of  Numbers  which  does  not 
satisfy  the  definition  of  Primes. 


4.4.4.  Another  illustration;  Discovering  Unique  Factorization 

Below  is  a  heuristic  rule  which  is  a  key  agent  in  the  process  of  "noticing"  the  fundamental 
theorem  of  arithmetic^^.  (The  reader  may  skip  this  subsection;  it  contains  more  details 
about  how  AM  actually  proposed  conjectures). 

If  P(a)  it  unexpectedly  •  B, 

Then  maybe  (Vx)  F(x)  is  a  B. 

Below,  the  same  rule  is  given  in  more  detail.  The  first  conjunct  on  the  IF-pari  of  the 
heuristic  rule  indicates  that  it’s  relevant  to  checking  examples  of  any  given  operation  F.  A 
typical  example  is  selected  at  random,  say  F(x)«=y.  Then  y  is  examined,  to  see  if  it  satisfies 
any  more  stringent  properties  than  those  specified  in  the  Domain/range  facet  of  F.  That  is, 
the  Domain/range  facet  of  F  contains  an  entry  of  the  form  A->B;  so  if  x  is  an  A,  then  all  we 
are  guaranteed  about  y  is  that  it  is  an  example  of  a  B.  But  now,  this  heuristic  is  asking  if  y 
isn’t  really  an  example  of  a  much  more  specializeo  concept  than  B.  If  it  is  (say  it’s  an 
example  of  a  BB),  then  the  rest  of  the  examples  of  F  are  examined  to  see  if  they  too  satisfy 
this  same  property.  If  all  examples  appear  to  map  from  domain  set  A  into  range  set  BB 
(where  BB  is  much  more  restricted  than  the  set  B  specified  originally  in  the  Domain/range 
facet  of  F),  then  a  new  conjecture  is  made:  the  domain/range  of  F  is  really  A-+BB,  not  A-»B. 
Here  is  that  rule,  in  crisper  notation: 
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If  th«  current  task  is  to  ChacK  Examples  of  th*  oparation  F, 
and  F  is  an  operation  from  domain  A  into  rans*  B, 
and  F  has  at  least  10  examples, 

and  a  typical  one  of  these  examples  is  "<x-»y>"  (so  xfA  and  y<B), 

and  (Forsome  Specialization  BB  of  B),  y  is  a  BB. 

and  all  examples  of  F  (ignoring  boundary  cases)  turn  out  to  be  BB’s, 

Then  print  the  following  conjecture:  “F(a)  is  aiways  a  BB,  not  simply  a  B", 
and  add  it  to  the  Examples  facet  of  Conjectures  concept, 

and  add  "<A  -»  BB)"  as  a  new  entry  to  the  Domain/range  facet  of  F,  replacing 
"<A-»B>", 

and  if  the  user  asks,  inform  him  that  the  evidence  for  this  was  that  all  ||Examples(F}|| 
examples  of  F  (ignoring  boundary  examples)  turned  out  te  be  BB’s, 
and  check  the  truth  of  this  conjecture  by  running  F  on  boundary  examples  of  A. 

Let’s  see  how  this  rule  was  used  in  one  instance.  In  Task  79  in  the  sample  excerpt  in 
Chapter  2,  AM  defined  the  concept  Prime-times,  which  was  a  function  transforming  any 
number  n  into  the  set  of  all  factorizations  of  n  into  primes.  For  example,  Prime- 
times(12)-{(2  2  3)},  Prime-times(13)-{(13)}.  The  domain  of  F«Prime-times  was  the  concept 
Numbers.  The  range  was  Sets.  More  piecisely,  the  range  was  Sets-of-Bags-of -Numbers,  but 
AM  didn’t  know  that  concept  at  that  time. 

The  above  heuristic  rule  was  applicable.  F  was  Prime-times,  A  was  Numbers,  and  B  was 
Sets.  There  were  far  more  than  10  known  examples  of  Prime-times  in  action.  A  typical 
example  was  this  one:  <21  -♦  {(3,7)}>.  The  rule  now  asked  that  {(3,7)}  be  fed  to  each 
specialization  of  Sets,  to  see  if  it  satisfied  any  of  their  definitions.  The  Specializations  facet 
of  Sets  was  acccessed,  and  each  concept  pointed  to  was  run  (its  definition  was  run)  on  the 
argument  "{(3,7)}".  It  turned  out  that  Singleton  and  Set-of-doubletons  were  the  only  two 
specializations  of  Sets  satisfied  by  this  example.  At  this  moment,  AM  had  narrowed  down 
the  potential  conjectures  to  these  two; 

1 .  Prime-times(x)  is  always  a  singleton  set. 

2.  Prime-times(x)  is  always  a  set  of  doubletons. 

Each  example  of  Prime-times  was  examined,  until  one  was  found  to  refute  each  conjecture 
(for  example,  <8-*{(2,2,2)}>  destroys  conjecture  2).  But  no  example  was  able  to  disprove 
conjecture  1.  So  the  heuristic  rule  plunged  forward,  and  printed  out  to  the  user  "A  new 
conjecture:  Prime-times(n)  is  always  a  singleton -set,  not  simply  a  set".  The  entry 
<Numbers-+Singleton-sets>  was  added  to  the  Domain/range  facet  of  Prime-times,  replacing 
the  old  entry  <Numbers-+Sets>. 

Let’s  digress  for  a  moment  to  discuss  the  robustness  of  the  system.  What  if  this  heuristic 
were  to  be  excised?  Could  AM  still  propose  unique  factorization?  The  answer  is  yes,  there 
are  other  ways  to  notice  it.  If  AM  has  the  concept  of  a  Function^®,  then  a  heuristic  rule  like 
the  one  in  the  previous  subsection  (page  50)  will  cause  AM  to  ask  if  Prime-times  is  not 
merely  a  relation,  but  also  a  Function. 
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The  past  few  sections  should  have  convinced  the  reader  that  isolated  heuristic  rules  really 
can  do  all  kinds  of  things:  propose  new  tasks,  create  new  concepts,  fill  in  entries  for  specific 
facets  (goal-driven),  and  look  for  conjectures  (data-driven  empirical  induction).  The  rules 
appear  fairly  general^'’  -  though  that  must  be  later  verified  empirically.  They  are 
redundant  in  a  pleasing  way:  some  of  the  most  "important",  well-known,  and  interesting 
conjectures  can  (apparently)  be  derived  in  many  ways.  Again,  we'll  have  to  check  this 
experimentally. 


4.5.  Gathering  Relevant  Heuristics 

Each  concept  has  facets  which  contain  some  heuristics.  Some  of  these  are  for  filling  in, 
some  for  checking,  some  for  deciding  interestingness^^  some  for  noticing  new  conjectures, 
etc. 

AM  contains  hundreds  of  these  heuristics.  In  order  to  save  time  (and  to  make  AM  appear 
more  rational),  each  heuristic  should  only  be  tried  in  situations  where  it  might  apply,  where 
it  makes  sense. 

How  is  AM  able  to  zero  in  on  the  relevant  heuristic  rules,  once  a  task  has  been  selected 
from  the  agenda? 


4.5.1.  Domain  of  Applicability 

The  secret  is  that  each  heuristic  rule  is  sto-ed  somewhere  a  propos  to  Its  "domain  of 
applicability".  This  "proper  place"  is  determined  by  the  first  conjunct  in  the  left-hand  side 
of  the  rule. 

What  does  this  mean?  Consider  this  heuristic: 

If  the  current  task  is  to  fill  in  examples  of  the  operation  F,  <■■ . 

and  some  examples  of  the  domain  of  F  are  known, 

Then  one  way  to  get  examples  of  F  is  to  run  F  on  randomly  chosen  examples  of  the  domain 
of  F. 

This  is  a  reasonable  thing  to  try  -  but  only  in  certain  situations.  Should  it  be  tried  when 
the  current  task  is  to  check  the  Worth  facet  of  the  Sets  concept?  No,  it  would  be  irrational. 
Of  course,  even  if  it  were  tried  then,  the  left-hand-side  would  fail  very  quickly.  Yet  some 
epu  time  would  have  been  used,  and  if  the  user  were  watching,  his  opinion  of  AM  would 
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{«,  eppircable  m  miny  situetione  It  would  be  wore#  than  utelea*  if  a  rule  exiated  which  could  lead  to  a  aingle  discovery 
like  “Fibonacci  senes"  but  never  lead  to  any  other  discoveries.  The  reason*  for  demandiiig  generality  are 
not  o.ily  "fairness",  but  the  nsights  which  occur  when  it  is  observed  that  several  disparate  concept*  were 
all  motivated  by  the  same  general  principle  (eg,  "looking  for  the  inverse  image  of  extrema") 

25 

The  reader  ha*  already  seen  several  heuristics  useful  for  filling  in  and  checkmg  facets:  here  i*  on*  for  judging 
interestingness'  an  entry  on  the  Interest  facet  of  Compose  say*  that  a  composition  AoB  i*  more  interesting 
if  the  range  of  6  equals  the  domain  of  A„  than  if  if  they  only  partially  overlap. 


Ch«pttr  4 


AM:  Oiicovtry  in  M«th«mitfct  *«  H«urit(ic  Search 


-55- 


dccrease.^® 

That  particular  heuristic  has  a  precise  domain  of  applicability;  AM  should  use  it  whenever 
the  current  task  is  to  fill  in  examples  of  an  operation,  and  only  in  those  kinds  of  situations. 

The  key  observation  is  that  a  heuristic  typically  applies  to  all  examples  of  a  particular 
concept  C.  In  ihe  case  we  were  considering,  C=Operation.  Intuitively,  we’d  like  to  tack  that 
heuristic  onto  the  Examples  facet  of  the  concept  Operation,  so  it  would  only  "come  to  mind" 
in  appropriate  situations.  This  is  in  fact  precisely  where  the  heuristic  rule  is  stored. 

Initially,  the  author  identified  the  proper  concept  C  and  facet  F  for  each  heuristic  H  which 
AM  possessed,  and  tacked  H  onto  C.F^^.  This  was  all  preparation,  completed  long  before 
AM  started  up.  Each  heuristic  was  tacked  onto  the  facet  which  uniquely  indicates  its 
domain  of  applicability.  The  first  conjunct  of  the  IF-part  of  each  heuristic  indicates  where 
it  is  stored  and  where  it  is  applicable.  Notice  the  little  arrow  (<==)  pointing  to  that  conjunct 

above.^*. 

While  AM  is  running,  it  will  choose  a  task  dealing  with,  say,  facet  F  of  concept  C.  AM 
must  quickly  locate  the  heuristic  rules  which  are  relevant  to  satisfying  that  chosen  task. 
AM  simply  locates  all  concepts  which  claim  C  as  an  example.  If  the  current  task  were 
"Check  the  Domain/rang#  of  UnionoUnion"^®,  then  C  would  be  UnionoUnion.  Which  concepts 
claim  C  as  an  example?  They  include  Compose-with-Self,  Composition,  Operation,  Active, 
Any-concept,  and  Anything.  AM  then  collects  the  heuristics  tacked  onto  facet  F  (in  this 
case,  F  is  Domain/range)  of  each  of  those  concepts.  All  such  heuristics  will  be  relevant.  In 
the  current  case,  some  relevant  heuristics  might  be  garnered  from  the  Domain/range  facet  of 
the  concept  Operation.  Any  heuristic  which  can  deal  with  the  Domain/range  facet  of  any 
operation  can  certainly  deal  with  UnionoUnion’s  Domain/range.  A  typical  rule  on 
Operation.Domain/range.Check^®  would  be  this  one: 

If  •  Dom/ran  entry  of  F  is  of  the  form  <D  D  D...D  -*  R>,  where  R  is  a  generalization  of 

0, 

Then  test  whether  the  range  might  not  be  simpiy  D. 

Suppose  one  entry  on  UnionoUnion.Dom/ran  was  '<Nonempty-sets  Nonempty-sets 
Nonempty-sets  -+  Sets>’.  Then  this  last  heuristic  rule  would  be  relevant,  and  would  have 
AM  ask  the  plausible  question:  Is  the  union  of  three  nonempty  sets  always  nonempty?  The 
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Tht«  notion  of  worrying  *bout  i  humtn  uttr  vho  it  obttrving  AM  run  in  rial  timt  may  apptar  to  bt  quit*  languago-  and 
machmt-dtptndtnl  An  inertatt  in  tpeed  of  a  coupit  ordtrt  of  magniludt  would  radically  tltar  lh« 
qualilativt  appearance  of  AM.  In  Chapter  7,  however,  the  reader  will  gratp  how  difficult  it  it  to 
objectivtiy  rate  a  tytlem  like  AM  For  that  rtaton,  all  meaturet  of  judgment  mutt  be  retpected.  Alto,  to 
the  actual  human  being  uting  the  tyttem  thit  really  it  one  of  the  matt  important  meaturet. 

Recall  that  C.F  it  an  abbreviation  for  facet  F  of  concept  C 

In  the  LISP  implementation,  theie  firtt  conjunett  are  omitted,  tinea  the  placement  of  a  heurittic  tervat  the  tame 
purpote  at  if  it  had  tome  ’pre-preeonditiont*  (like  theta  firtt  conjunett)  to  determine  relevance  quickly. 

Thit  operation  it  defined  at  UnionoUnion(x,y,z)  •  (x  U  y)  U  x.  It  acceptt  3  tett  tt  argumentt,  end  returnt  a  new 

tat  at  Ht  value. 


^  the  *Check’  tiAifKet  of  the 


'Domtin/range’  facet  of  the  'Operation'  concept. 
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answer  is  affirmative,  empirically,  so  AM  modifies  that  Domain/range  entry  for 
UnionoUnion.  AM  would  ask  the  same  question  for  Intersectointersect.  Although  the 
answer  then  would  be  'No',  it's  still  a  rational  inquiry.  If  AM  called  on  this  heuristic  rule 
when  the  current  task  was  "Fillin  spieialiaations  of  Bags",  it  would  clearly  be  an  irrational  act. 
The  domain  of  applicability  of  the  rule  is  clear,  and  is  precisely  fitted  to  the  slot  where  the 
rule  is  stored  (tack^  onto  Operation.Domain/range). 

To  recap  the  basic  Idea:  when  dealing  with  a  task  "Do  act  A  on  facet  F  of  concept  C",  AM 
must  locate  aii  the  concepts  X  claiming  C  as  an  example.  AM  then  gathers  the  heuristics 
tacked  onto  X.F.A,  for  each  such  general  concept  X.  All  of  them  -  and  only  they  -  are 
relevant  to  satisfying  that  task. 

So  the  whole  problem  of  locating  relevant  heuristics  has  been  reduced  to  the  problem  of 
efficiently  finding  all  concepts  of  which  C  is  an  example  (for  a  given  concept  C).  This 
process  is  called  "rippling  away  from  C  in  the  ISA  direction",  and  forms  the  subject  of  the 
next  subiection. 


4.5.2.  Rippling 

Given  a  concept  C,  how  can  AM  find  all  the  concepts  which  claim  C  as  an  example? 

The  most  obvious  scheme  is  to  store  this  information  explicitly.  So  the  Examples  fac»»t  of  C 
would  point  to  ail  known  examples  of  C,  and  the  Isa  facet  of  C  would  point  to  all  known 
concepts  claiming  C  as  one  of  their  examples.  Why  not  just  do  this?  Because  one  can 
substitute  a  modest  amount  of  processing  time  (via  chasing  links  around)  for  the  vast 
amount  of  storage  space  that  would  be  needed  to  have  "everything  point  to  everything". 

Each  facet  contains  only  enough  pointers  so  that  the  entire  graph  of  Exs/Isa  and  Spec/CenI 
links  could  be  reconstructed  if  needed.  Since  "Genl"^'  is  a  transitive  relation,  AM  can 
compute  that  Numbers  is  a  generalization  of  Mersenne-primes,  if  the  facet  Mersenne- 
primes.Genl  contains  the  entry  "Odd-primes",  and  Odd-primes.Genl  contains  a  pointer  to 
"Primes",  and  Primes.GenI  points  to  "Numbers".  This  kind  of  "rippling”  activity  is  used  to 
efficiently  locate  all  concepts  related  to  a  given  one  X.  In  particular,  AM  knows  how  to 
"ripple  upward  in  the  Isa  direction",  and  quickly^^  locate  all  concepts  which  claim  X  as  one 
of  their  examples. 

It  turns  out  that  AM  cannot  simply  call  for  X.Isa,  then  the  Isa  facets  of  those  concepts,  etc., 


31 

'Gtnl'  i*  in  (bbrtvitlion  for  th«  Gtntraliiiliont  facet  of  a  concept,  aimilirl/,  "Spec"  meant  Specializationt,  Ext  meant 
Examplet,  etc.  ’Ita”  it  the  eonvarta  facet  to  Ext;  ie,  A  <  BExt  iff  B  t  Alta.  Saying  "GenI  it  trantitive"  jutt 
meant  the  following'  if  A  it  a  generalization  of  B,  and  B  of  C,  then  A  it  alto  a  generalization  of  C. 

32 

With  about  200  known  concaptt,  with  each  lea  facet  and  aKh  GenI  facet  pointing  to  about  3  other  concepta,  about  25 
linkt  will  be  traced  along  in  order  to  locate  about  a  dozer,  fmal  conceptt,  aKh  of  which  claimt  the  given  one 
at  an  example  Thia  whole  rippling  procata,  trKi-.,  25  linkagat,  utat  lett  than  01  epu  aecondt,  in 
compiled  Inlarliap,  on  a  KUO  type  POP-10. 
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because  Isa  is  not  transitive®^.  For  the  interested  reader,  the  algorithm  AM  uses  to  collect 
Isa’s  of  X  is  given  below.®’* 

1.  All  generalizations  of  the  given  concept  X  are  located.  AM  accesses  X.GenI,  then 

the  Genl  facets  of  those  concepts,  etc. 

2.  The  "Isa"  facet  of  each  of  those  concepts  is  accessed. 

3.  AM  locates  all  generalizations  of  these  newly-found  higher-level  concepts.  This  is 

the  list  of  all  known  concepts  which  claim  X  as  one  of  their  examples. 

In  regular  form,  one  might  express  this  rippling  recipe  more  compactly  as: 
G«nl*(l«a(G;«nl*(X))).  There  is  not  much  need  for  a  detailed  understanding  of  this  process, 
hence  it  will  not  be  delved  into  further  in  this  thesis.  This  section  probably  already 
contains  more  than  anyone  would  want  to  know  about  rippling.®^ 


4.5.3.  Ordering  the  Relevant  Heuristics 


Now  that  all  these  relevant  heuristics  have  been  assembled,  in  what  order  should  A.M 
execute  them?®®  It  is  important  to  note  that  the  heuristics  tacked  onto  very  general  concepts 
will  be  applicable  frequently,  yet  will  not  be  very  powerful.  For  example,  here  is  a  typical 
heuristic  rule  which  is  tack^  onto  the  Examples  facet  of  the  very  general  concept  Any- 
concept: 


If  the  current  task  is  to  fill  in  examples  of  any  concept  X, 

Then  one  way  to  get  them  is  to  symbolically  instantiate®®  a  definition  of  X. 


It  takes  a  tremendous  amount  of  inference  to  squeeze  a  couple  awkward  examples  of 
Intersectolntersect  out  that  concept’s  definition.  Much  time  could  be  wasted  doing  so®^ 


33 

K  X  i»«  y,  and  y  wi  z,  than  x  i«  (gtnarilly)  NOT  •  z.  Thia  ia  Am  to  tha  intranaitivity  of  "mambar-of".  Ganaralization  ia 
tranaitiva,  on  tha  othar  hand,  bacauta  ”atAaa!-of'  ia  tranaitiva. 

®^  For  tha  vary  intaraatad  raadar,  it  ia  axplainad  in  graat  datail  in  fila  RIPPLE[dia,dbl]  at  SAIL  Thia  fila  haa  baan 
parmanantly  archivad  at  SAIL. 

35 

Tha  diactraaion  balow  aaaumaa  that  tha  hauriatica  don't  intaract  with  aach  othar;  i.a,  that  aach  ona  may  act  indapendantly 
of  all  othara.  Tha  validity  of  thia  amplification  ia  taalad  empirically  (aaa  Chapter  6)  and  diacuaaad 
thaoratka'ly  (aaa  Chapter  7)  later. 

3fi 

"Symbolic  inatantiation''  ia  a  auphamiam  for  a  bag  of  trkka  which  tranaform  a  daclarativa  definition  of  a  concapt  into 
particular  antitiea  aatiafying  that  definition.  Tha  only  conatraint  on  tha  tricka  ia  that  they  not  actually  riM 
tha  definition  Ona  auch  trick  might  be:  if  tha  definition  la  racuraiva,  maraly  find  aoma  entity  that  aatiafiaa 
tha  baaa  atap.  AM'a  aymbolic  inatantiation  Irkka  are  too  hand-craftad  to  be  of  graat  intaraat,  hanca  thia  will 
not  ba  covarad  any  more  deeply  here.  Tha  intaraatad  raadar  ia  directed  to  tha  pionaering  work  by 
[Lombardi  A  Raphael  64],  or  tha  more  recant  ktaratura  on  thaaa  tachniquaa  applied  to  automatic  program 
verification  (a.g.,  [Moore  75]). 

37 

Incidantally,  thia  Hluatrataa  why  no  aingla  hauriat'ie  ahould  ba  altowad  to  monopolize  tha  procaaaing  of  any  one  taak. 
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Just  as  general  heuristics  are  weak  but  often  relevant,  specific  heuristics  are  powerful  but 
rarely  relevant.  Consider  this  heuristic  rule,  which  is  attached  to  the  very  specific  concept 
Compose-with-Self: 

If  <h«  currant  task  is  to  fill  in  axampias  of  tha  composition  FoF, 

Than  includa  any  fixad-points  of  F. 

For  example,  since  Intersect(pAt,X)  equals  phi,  so  must  Intersectolntersect(/>At,X,Y).^®. 
Assuming  that  such  examples  exist  already  on  Intersect,  this  heuristic  will  fill  in  a  few 
examples  of  Intersectolntersect  with  essentially  no  processing  required.  Of  course  the 
domain  of  applicability  of  this  heuristic  is  minuscule. 

As  we  expected,  the  narrower  its  domain  of  applicability,  the  more  powerful  and  efficient  a 
heuristic  is,  and  the  less  frequently  it's  useful.  Thus  in  any  given  situation,  where  AM  has 
gathered  many  heuristic  rules,  it  will  probably  be  best  to  execute  the  most  specific  ones  first, 
and  execute  the  most  general  ones  last. 

Below  are  summarized  the  three  main  points  that  make  up  AM’s  scheme  for  finding 
relevant  heuristics  in  a  "natural"  way  and  then  using  them: 

1.  Each  heuristic  Is  tacked  onto  the  most  general  concept  for  which  it  applies:  it  is 
given  as  large  a  domain  of  applicability  as  possible.  This  will  maximize  its 
generality,  but  leave  its  power  untouched.  This  brings  it  closer  to  the  "ideal" 
tradeoff  point  between  these  two  quantities. 

2.  When  the  current  task  deals  with  concept  C,  AM  ripples  away  from  C  and  quickly 

locates  all  the  concepts  of  which  C  Is  an  example.  Each  of  them  will  contain 
heuristics  relevant  to  dealing  with  C. 

3.  AM  then  applies  those  heuristics  In  order  of  increasing  generality.  You  may  wonder 

how  AM  orders  the  heuristics  by  generality.  It  turns  out  that  the  rippling  process 
automatically  gathers  heuristics  in  order  of  increasing  generality.  In  the  LISP 
system,  each  rule  is  therefore  executed  as  soon  as  it’s  found.  So  AM  nevers  wastes 
time  gathering  heuristics  it  won’t  have  time  to  execute. 


This  section  will  briefly  characterize  the  collection  of  242  heuristic  rules  which  AM  was 
originally  given.  A  complete  listing  of  those  rules  is  found  in  Appendix  3;  the  riile 
numbers  below  refer  to  the  numbering  given  in  that  appendix. 


phi  i*  tnaOnc  ntmt  for  th«  amply  tot,  ilto  wrrttan  {)  This  hot  oonlanct  Ihut  tiyt  Ihtl  tinct  {}  H  X  •  {},  than  <{}  n 
X)  n  y  mutt  tlto  aqutl  {} 
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4.6.1.  Heuristics  Grouped  by  the  Knowledge  They  Embody 

Many  heuristics  embody  the  belief  that  mathematics  is  an  empirical  Inquiry.  That  is,  one 
approach  to  discovery  is  simply  to  perform  experiments,  observe  the  results,  thereby  gather 
statistically  significant  amounts  of  data.  Induce  from  that  data  some  new  conjectures  or  new 
concepts  worth  Isolating,  and  then  repeat  this  whole  process  again.  Some  of  the  rules  which 
capture  this  spirit  are  numbers  21,  43-57,  91,  136-139,  146-148,  153-154,  212-216,  225, 
and  241.  As  one  might  expect,  most  of  these  are  "Suggest"  type  rules.  They  indicate 
plausible  moves  for  AM  to  make,  promising  new  tasks  to  try,  new  concepts  worth  studying. 
Almost  all  the  rest  are  "Fillln"  type  rules,  providing  empirical  methods  to  find  entries  for  a 
specified  facet. 

Another  large  set  of  heuristics  Is  used  to  embody  -  or  to  modify  -  what  can  be  called 
"focus  of  attention".  When  should  AM  keep  on  the  same  track,  and  when  not?  The  first 
rules  expressing  varying  nuances  of  this  idea  are  numbers  1-5.  The  last  such  rules  are 
numbers  209-216.  Some  of  these  rules  are  akin  to  goal-setting  mechanisms  (e.g.,  rule  141). 
In  addition,  many  of  the  "Interest"  type  rules  have  some  relation  to  keeping  AM  interested 
in  recently-chosen  concepts  (or:  In  concepts  related  to  them,  e.g.  by  Analogy,  by  Genl/Spec, 
by  Isa/Exs,...). 

The  remaining  "Interest"  rules  are  generally  some  re-echoing  of  the  following  notion:  X  is 
Interesting  if  F(X)  has  an  unexpected  (interesting)  value.  For  example,  in  rule  26,  "F(X)" 
is  just  "Generalizations  of  X".  In  slightly  more  detail,  the  principle  characteristics  of 
interestingness  are: 

•  symmetry  (e.g.,  in  an  expanding  analogy) 

•  coincidence  (e.g.,  in  a  concept  being  re-discovered  often) 

•  appropriateness  (e.g.,  in  choosing  an  operation  H  so  that  GeH  will  have  nicer 

Domain/Range  characteristics  than  G  itself  did) 

•  recency  (see  the  previous  paragraph  on  focus  of  attention) 

•  individuality  (e.g.,  the  first  entity  observed  which  satisfies  some  property) 

'  usefulness  (e.g.,  there  are  many  conjectures  involving  it) 

•  association  (i.e.,  the  given  concept  is  related  to  an  interesting  one) 


One  group  of  heuristic  rules  embeds  syntactic  tricks  for  generalizing  definitions  (Lisp 
predicates),  specializing  them,  instantiating  them,  symbolically  evaluating  them.  Inverting 
them,  rudimcntarily  analyzing  them,  etc.  For  example,  see  rules  31  and  89.  Some  rules 
serve  other  syntactic  functions,  like  ensuring  that  various  limits  aren’t  exceeded  (e.g.,  rule 
15),  that  the  format  for  each  facet  is  adhered  to  (e.g.,  rule  16),  that  the  entries  on  each 
facet  are  used  as  they  are  meant  to  be  (e.g.,  rules  9  and  59),  etc.  Many  of  the  "Check" 
type  heuristics  fall  into  this  category. 

Finally,  AM  possesses  a  mass  of  miscellaneous  rules  which  evade  categorization.  See,  e.g., 
rules  185  and  236.  These  range  from  genuine  math  heuristics  (rules  which  lead  to  discovery 
frequently)  to  simple  data  management  hacks. 

No  detailed  analysis  has  been  performed  on  the  set  of  heuristics  AM  possesses,  as  of  the 
time  of  writing  of  this  thesis. 


A 
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4.6.2.  Heuristics  Grouped  by  How  Specific  They  Are 

Another  dimension  of  distribution  of  heuristics,  aside  from  the  above  functional  one,  is 
simply  that  of  how  high  up  in  the  Genl/Spec  tree  they  are  located.  The  table  belov 
summarizes  how  the  rules  were  distributed  in  that  tree: 


LEVEL 

IL 

•  Con’* 

•  w/H«ur 

•  H«urt 

Avc  w/Htur  •  Fillin 

■Suee 

•  Chtck 

«  Int 

0 

Anything 

1 

1 

10 

10.0 

10.0 

0 

5 

0 

5 

1 

Any-Concept 

1 

1 

no 

110.0 

110.0 

39 

30 

20 

21 

2 

Active 

2 

2 

24 

12.0 

12.0 

7 

10 

4 

3 

3 

Operation 

6 

3 

31 

5.2 

10.3 

11 

3 

3 

14 

^4 

Union 

100 

11 

63 

0.6 

5.7 

26 

15 

8 

16 

Here  is  a  key  to  the  column  headings: 

LEVEL:  How  far  down  the  Genl/Spec  tree  of  concepts  we  are  looking, 
e.g.:  A  sample  concept  at  that  level. 

•  Con’s:  The  total  number  of  concepts  at  that  level. 

•  w/Heur:  How  many  of  them  have  some  heuristics. 

•  Heurs:  The  total  number  of  heuristics  attached  to  concepts  at  that  level. 

Avg:  (•  Heurs)  /  (*  Concepts);  i.e.,  the  mean  number  of  heuristics  per  concept,  at  that 
level. 

Avg  w/Heur:  (•  Heurs)  /  (•  w.  Heurs) 
t  Fillin:  Total  number  of  "Fillin"  type  heuristics  at  that  level. 

•  Sugg:  Total  number  of  "Suggest"  type  heuristics  at  that  level. 

•  Check:  Total  number  of  "Check"  type  heuristics  at  that  level. 

•  Int:  Total  number  of  "Interestingness"  type  heuristics  at  that  level. 


The  heuristic  rules  are  seen  not  to  be  distributed  uniformly,  homogeneously  among  all  the 
initial  concepts.  The  extent  of  this  skewing  was  not  realized  by  the  author  until  the  above 
table  was  constructed.  A  surprising  proportion  of  rules  are  attached  to  the  very  general 
concepts.  The  top  lOX  of  the  concepts  contain  73/J  of  ail  the  heuristics.  One  notable 
exception  is  the  "Interest"  type  heuristics:  they  seem  more  evenly  distributed  throughout  the 
tree  of  initial  concepts.  This  tends  to  suggest  that  future  work  on  providing  "meta- 
heuristics"  should  concentrate  on  how  to  automatically  synthesize  those  Interest  heuristics  for 
newly-created  concepts. 


Chapter  5.  AM’s  Concepts 


This  chapter  contains  material  about  AM’s  anatomy.  After  a  brief  overview,  we’ll  look  in 
detail  at  the  way  concepts  are  represented  (Section  5.2).  This  includes  a  discussion  of  each 
kind  of  facet  a  concept  may  possess.  Wedged  in  among  the  implementation  details  and 
formats  are  a  horde  of  tiny  ideas;  they  should  be  useful  to  anyone  contemplating  working 
cn  a  system  similar  in  design  to  AM. 

The  chapter  closes  by  sketching  all  the  knowledge  AM  starts  with.  The  concepts  will  be 
diagrammed,  and  will  also  have  a  brief  description,  sufficient  for  the  reader  to  follow  later 
chapters  without  trouble.  Instead  of  using  up  a  large  number  of  pages  for  an  unreadable 
listing  of  ait  of  the  specific  information  initially  supplied  rt  each  concept,  such  complete 
coverage  is  relegated  to  Appendix  2.1.  ' 

The  next  chapter  starts  on  page  IH.® 


5.1.  Motivation  and  Overview 

Each  concept  consists  merely  of  a  bundle  of  facets.  The  facets  represent  the  different  aspects 
of  each  concept,  the  kinds  of  questions  one  might  want  to  ask  about  the  concept: 

How  valuable  is  this  concept? 

What  is  its  definition? 

If  it’s  an  operation,  what  is  legally  in  its  domain? 

What  are  some  generalizations  of  this  concept? 

How  can  you  separate  the  interesting  instances  of  this  concept  from  the  dull  ones? 
etc. 

Since  each  concept  is  a  mathemati  il  entity,  the  kinds  of  questions  one  might  ask  are  fairly 
constant  from  concept  to  concept.  This  set  of  questions  might  change  significantly  for  a  new 
domain  of  concept. 

One  "natural"  representation  for  a  concept  in  LISP  is  therefore  as  a  set  of  attribute/value 


Th*t  tpp«ndix  li«t»  ttch  eonctpt,  fivinf  t  condtnttd  of  th«  facto  initially  (ivtn  (by  th«  author)  to  AM  about  tach 
facot  of  that  eonctpt.  This  mattrial  i*  tranatatad  from  LISP  into  Enflith  and  atandard  math  notation.  Tha 
appandii  la  pracadad  by  an  alphabaticti  indax  of  tha  concapta  and  tha  paga  numbar  on  whKh  thay  ara 
praaantad  That  indax  ia  on  paft  173.  Soma  unmodifiad  'concapta"  ••  atill  in  LISP  —  ara  diaplay^  in 
Appandix  23 

2 

Though  davoid  of  thaoraticti  tignificanca,  that  aantanca  haa  alaa  provad  of  high  ampirieal  valua. 
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pairs.  That  is,  each  concept  is  maintained  as  an  atom  with  a  property  list.  The  names  of  the 
properties  (Worth,  Definitions,  Domain/Range,  Generalizations,  Interestingness,  etc.) 
correspond  to  the  questions  above,  and  the  value  stored  under  property  F  of  atom  C  is 
simply  the  value  of  the  F- facet  of  the  C-concept.  This  value  can  also  be  viewed  as  the 
answer  which  expert  C  would  give,  if  asked  question  F.  Or,  it  can  be  viewed  as  the  contents 
of  slot  F  of  frame  C. 


5.1.1.  A  Glimpse  of  a  Typical  Concept 

As  an  example,  here  is  a  stylized  rendition  of  the  SETS  concept.  This  is  a  concept  which  is 
meant  to  correspond  to  the  notion  of  a  set  of  elements.  The  format  P:  V|,V2,...  is  used  to 

indicate  th'^t  the  vaue  of  property  P  is  the  list  Vj,V2,...  That  is,  the  concept  Sets  has  entries 
V  j,V2,...  for  its  facet  P.  For  example,  according  to  the  box  below,  "Singleton"  is  one  entry  on 
the  Specializations  facet  of  Sets. 


I  shall  not  digress  here  to  explain  each  of  these  entries  -  and  what  are  apparently 
omissions.  Such  things  will  be  done  later  in  this  chapter^.  For  now,  just  glance  at  it  to  get 
the  flavor  of  what  a  concept  is  like. 
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Nam«(t):  Sat,  Clast,  Coltaciion 
Dafinitioni! 

Raeuriiva:  X  (S)  [$■{}  or  Sat.Dafinition  (Ramova(Anymambar(S),S))] 
Racurtiva  quick:  X  (S)  [$■{}  or  Sat.Dafinition  (CDR(S))] 

Quick!  X  (S)  [Match  S  with  {...}  ] 

Spacializations:  Emptysat,  Nonamptyiat,  Sat-of-structuras,  Singlaton 
Ganaralizations!  Unordarad-Structura,  No-multipla*alamants-Structura 
Examplas! 

Typical:  {{}},  {A},  {A,B},  {3} 

Baraly:  {},  {A,  B,  {C,  { { {  A,  C,  (3,3,3,9),  <4,1  ,A,{B},A>}}}}} 

Not-quita:  {A,A},  (),  {B,A} 

Foibla:  <4,1,A,1> 

Conjac’s!  All  unordarad-structuras  ara  tats. 

Intu’s: 

Gaomatric!  Vann  diafram.  {Saa  [Vann  89],  or  [Skamp  71].} 

Analogs:  bag,  list,  osat 
Worth:  600 
Viaw: 

Pradicata:  X  (P)  {x<Domain(P)  |  P(x}} 

Structura:  X  (S)  Enclosa-in'bracatfSorttRamova-multipla'alamanttfS))) 
Suggast:  if  P  it  an  intarasting  pradicata  ovar  X,  contidar  [x<X  |  P(x)}. 
In-domain'Of:  Union,  Intarsaction,  Sat-diffaranca,  Sat*aquality,  Subiat,  Mambar 
In-ranga-of:  Union,  Intarsaction,  Sat*difforanca,  Satisfying 


To  decipher  the  Definitions  facet,  there  are  a  few  things  you  must  know.  An  expression  of 
the  form  "(X  (x)  E)"  is  called  a  Lambda  expression  after  Church^  and  may  be  considered 
an  executable  procedure,  it  accepts  one  argument,  binds  the  variable  "x"  to  the  value  of 
that  argument,  and  then  evaluates  "E"  (which  is  probably  some  expression  involving  the 
variable  x).  For  example,  "(X  (x)  (x+5))"  is  a  function  which  adds  5  to  any  number;  if  given 
the  argument  3,  this  lambda  expression  will  return  the  value  8. 

The  second  thing  you  must  know  is  that  facet  F  of  concept  C  will  occasionally  be 
abbreviated  as  C.F.  In  those  cases  where  F  is  "executable",  the  notation  C.F  will  refer  to 
applying  the  corresponding  function.  So  the  first  entry  in  the  Definitions  facet  is  recursive 
because  It  contains  an  embedded  call  on  the  function  Set.Definition.  Notice  that  we  are 
implying  that  the  name  of  that  lambda  expression  itself  i.s  "Set.Definition". 

There  are  some  bizarre  implications  of  this:  since  there  are  three  separate  but  equivalent 
definitions,  AM  may  choose  whichever  one  it  wants  when  It  recurs.  AM  can  choose  one  via 
a  random  selection  scheme,  or  always  try  to  recur  into  the  same  definition  as  it  was  just  in, 
or  perhaps  suit  its  choice  to  the  form  of  the  argument  at  the  moment. 

For  example,  one  definition  might  be  great  for  arguments  of  size  10  or  less,  but  slow  for 
bigger  ones,  and  another  definition  might  be  mediocre  for  all  size  arguments;  then  AM 


^  Safer*  and  durinf  CKureh,  H’t  cilhd  •  function.  $••  (CKurch  4 1 ). 
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should  use  the  mediocre  definition  over  and  over  again,  until  the  argument  becomes  small 
enough,  and  from  then  on  recur  only  into  the  fast  definition.  Although  AM  embodies  this 
"smart"  scheme,  the  little  comments  necessary  to  see  how  it  does  so  have  be  excised  from  the 
version  shown  above  in  the  box.  This  will  be  explained  later  in  this  chapter,  on  page  90. 

All  concepts  possess  executable  definitions,  though  not  necessarily  effective  ones.  They  each 
have  a  LISP  predicate,  but  that  predicate  is  not  guaranteed  to  terminate.  Notice  that  the 
definitions  for  Sets  are  all  definitions  of  finite  sets.^ 


5.1.2.  The  main  constraint:  Fixed  set  of  facets 

One  important  constraint  on  the  representation  is  that  the  set  of  facets  be  fixed  for  all  the 
concepts.  An  additional  constraint  is  that  this  set  of  facets  not  grow,  that  it  be  fixed  once 
and  for  ail.  So  there  is  one  fixed,  universal  list  of  two  dozen  types  of  facets.  Any  facet  of 
any  concept  must  have  one  of  those  standard  names.  All  concepts  which  have  some 
examples  must  store  them  as  entries  on  a  facet  called  Examples;  they  can’t  call  them 
Instances,  or  Cases,  or  G00037’s.  This  constraint  is  known  as  the  "Beings  constraint"^  and 
has  three  important  consequences: 

1.  OUTLINE:  First,  it  provides  a  nice,  distributed,  universal  framework  on  which  to 

display  all  that  is  known  about  a  given  concept.  For  example,  when  AM  creates  a 
new  concept  like  "Square-root",  the  user  can  judge  how  well  AM  understands  that 
concept  by  examining  Square-root’s  property-list  (the  list  of  entries  for  each  of  its 
facets).  Similarly,  AM  can  instantly  tell  what  facets  are  not  yet  filled  in  for  any 
given  concept,  and  this  will  in  turn  suggest  new  tasks  to  perform.  In  other  words, 
this  constraint  helps  define  the  "space"  which  AM  must  explore,  and  makes  it 
obvious  what  parts  of  each  concept  have  and  have  not  yet  been  investigated. 

2.  STRUCTURE:  The  constraint  specifies  that  there  be  a  set  of  facets,  not  just  one. 

This  set  was  made  large  enough  that  all  the  efficiency  advantages  of  a  "structured" 
representation  are  preserved  (unlike  totally  uniform  representations,  e.g.  pure 
production  systems  with  simple  memories  as  data  structures,  or  predicate  calculus). 

3.  UNIFORMITY:  The  most  Important  benefit  of  the  Beings  constraint  arises  when 
AM^  wants  to  get  a  particular  question  answered  -  especially  if  the  Information 
pertains  to  related  concepts.  The  advantage  is  that  it’ll  have  a  very  limited 
repertoire  of  questions  it  may  ask,  hence  there  will  be  no  long  searching,  no 
misunderstandings.  This  is  the  same  advantage  that  always  arises  when  everyone 
uses  a  common  language. 

We  shall  illustrate  the  last  two  advantages  by  using  the  Sets  concept  pictured  in  the  box  a 
couple  pages  ago.  How  does  AM  handle  a  task  of  this  form:  "Check  examples  of  Sets"?  AM 
accesses  the  examples  facet  of  the  Sets  concept,  and  obtains  a  bunch  of  items  which  are  all 


^  TS«  third  dafinitien,  ”{  may  not  look  tiftitt,  but  contidtr  thit  •llipti*  notation  it  not  ptrmitttd  within  tny  tpteific  ttt. 

^  S««  [Ltntt  75b]  Hittorktlly,  aich  conetpt  modult  wtt  ctiltd  t  ’BEING”. 

^  Actually,  th«  raqutttor  it  not  ’AM’  in  toto,  but  rathtr  timply  a  clautt  which  it  a  part  of  a  hturutk  rule,  or  a  bit  of  cod* 
tmbtddtd  within  an  tntry  on  an  txteutabit  feett,  tuch  at  Alforithmt 


Ch«pt«r  5 


AM:  Oitcovcry  in  Mathcnutict  «t  Hcurwtic  Stirch 


-65- 


probabty  sets.  If  any  isn’t  a  set,  AM  would  tike  to  make  it  one,  if  that  involves  nothing 
diflRcult.  AM  locates  all  the  generalizations  of  Sets*  and  comes  up  with  the  list  <Sets, 
Unordered-StVuctures,  No-multiple-elements-Structures,  Structures,  Objects,  Any-concept, 
Anything>.  Next,  the  "Check"  facet  of  each  of  these  is  examined,  and  all  its  heuristics  are 
collected.  For  example,  the  Check  facet  of  the  No-multiple-elements-Structures  concept 
contains  the  following  entry:  "Eliminate  multiple  occurrences  of  each  element"  (of  course  this 
is  present  not  as  an  English  sentence  but  rather  as  a  little  LISP  function).  So  even  though 
Sets  has  no  entries  for  its  Check  facet,  several  little  functions  will  be  gathered  up  by  the 
rippling  process.  Each  potential  set  would  be  subjected  to  all  those  checks,  and  might  be 
modified  or  discarded  as  a  result. 

There  is  enough  "structure"  around  to  keep  the  heuristic  rules  relevant  to  this  task  isolated 
from  very  irrelevant  rules,  and  there  is  enough  "uniformity"  to  make  finding  those  rules 
very  easy. 

The  same  rippling  would  be  done  to  find  predicates  which  tell  whether  a  set  is  interesting 
or  dull.  For  example,  one  entry  on  the  Interestingness  facet  of  the  Structure  concept  says 
that  a  structure  is  interesting  if  alt  pairs  of  members  satisfy  the  same  rare  predicate  P(x,y) 
[for  any  such  P].  So  a  set,  all  pairs  of  whose  members  satisfy  "Equality,"  would  be 
considered  interesting.  In  fact,  every  Singleton  is  an  interesting  Structure  for  just  that 
reason.  A  singleton  might  be  an  interesting  Anything  because  it  takes  only  a  few  characters 
to  type  it  out  (thereby  satisfying  a  criterion  on  Anything.Interest). 

To  locate  all  the  specializations  of  Sets,  the  rippling  would  go  in  the  opposite  direction.  For 
example,  one  of  the  entries  on  the  Specializations  facet  of  Sets  is  Set-of-structures;  one  if  Its 
Specialization  entries  is  Set-of-sets.  So  this  latter  concept  will  be  caught  in  the  net  when 
rippling  away  from  Sets  in  the  Specializations  direction. 

If  AM  wants  lots  of  examples  of  sets,  it  has  only  to  ripple  in  the  Specializations  direction, 
gathering  Examples  of  each  concept  it  encounters.  Examples  of  Sets-of-sets  (like  this  one: 
{{A},{{C,D}}})  will  be  caught  in  this  way,  as  will  examples  of  Sets-of-numbers  (like  this  one: 
{1,4,5}),  because  two  specializations  of  Sets  are  Sets-of-Sets  and  Sets-of-Numbers®. 

In  addition  to  the  three  main  reasons  for  keeping  the  set  of  facets  the  same  for  all  the 
concepts  (see  previous  page),  we  claimed  there  were  also  reasons  for  keeping  that  set  fixed 
once  and  for  all.  Why  not  dynamically  enlarge  it?  To  add  a  new  facet,  its  value  has  to  be 
filled  in  for  lots  of  concepts.  How  could  AM  develop  the  huge  body  of  heuristics  needed  to 
guide  such  filling-in  and  checking  activities?  Also,  the  number  of  facets  is  small  to  begin 
with  because  people  don’t  seem  to  use  more  than  a  few  tens  of  such  "properties"  in 
classifying  knowledge  about  a  concept'®.  If  the  viability  of  AM  seemed  to  depend  on  this 
ability,  I  would  have  worked  on  it.  AM  got  along  fine  without  being  able  to  enlarge  its  set 
of  facets,  so  no  time  was  ever  spent  on  that  problem.  I  leave  it  as  a  challenging,  ambitious 
"open  research  problem". 

*  by  'ripplinf*  upw*rd  from  Soto,  in  tho  GonI  diroction 

A 

"  Wo  aro  tttuminf  that  AM  hot  run  for  tomo  timo,  ond  olroidy  diocovorod  Numboro,  and  already  dofinod  Solt-of-Numbort. 

'®  Thto  data  m  lathorod  from  introopoction  by  myaolf  and  a  fow  othora,  and  ahouW  probably  bo  loatod  by  porformine  oomo 
paytholotical  oxporimonta. 
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5.1.3.  BElNGs  Representation  of  Knowlede 


Before  discussing  each  facet  in  detail,  let’s  interject  a  brief  historic  digression,  to  explain  the 
origins  of  this  modular  representation  scheme. 


The  ideas  arose  in  an  automatic  programming  context,  while  working  out  a  solution  to  the 
problem  of  constructing  a  computer  system  capable  of  synthesizing  a  simple  concept- 
discrimination  program  (similar  to  [Winston  70]).  The  scenario  envisioned  was  one  of 
mutual  conperatioii  among  a  group  of  a  hundred  or  so  experts,  each  a  specialist  in  some 
minute  detail  of  coding,  concept  formation,  debugging,  communicating,  etc.  Each  expert  was 
modelled  by  one  module,  one  BEING.  Each  BEING  had  the  same  number  of  slots  (parts, 
facets),  and  each  slot  was  interpreted  as  a  question  which  that  BEING  could  answer.  The 
community  of  experts  carried  on  a  round-table  discussion  of  a  programming  task  which  was 
specified  by  a  human  user.  Eventually,  by  cooperating  and  answering  each  other’s 
questions,  they  hammered  out  the  program  he  desired.  See  [Lenat  75b]  for  details. 


The  final  system,  called  PUP6,  did  actually  synthesize  several  large  LISP  programs, 
including  many  variants  of  the  concept-learning  program.  This  is  described  fully  in  [Lenat 
75a].  Unfortunately,  PUP6  had  virtually  no  natural  language  ability  and  was  therefore 
unusable  by  an  untrained  human.  Its  modal  output  was  "Eh?”. 

The  search  for  a  new  problem  domain  where  this  communication  difficulty  wouldn’t  be  so 
severe  led  to  consideration  of  elementary  mathematics. 

The  other  main  defect  of  PUP6  was  its  narrowness,  the  small  range  of  ‘target’  programs 
which  could  be  synthesized.  PUPS  had  been  designed  with  just  one  target  in  mind,  and 
almost  all  it  could  do  was  to  hit  that  target.  The  second  constraint  on  thr  new  task  domain 
was  then  one  of  having  a  non-specific  target,  a  very  broad  or  diffuse  goal.  This  pointed  to 
an  automated  researcher,  rather  than  a  problem-solver. 

These  two  constraints  then  were  (i)  elementary  math,  because  of  communication  ease,  and 
(ii)  self-guided  exploration,  because  of  the  danger  of  too  specific  a  goal.  Together,  they 
directed  the  author  to  an  investigation  which  ultimately  resulted  in  the  AM  project. 


5.2.  Facets 

How  is  each  concept  represented?  Without  c!::iming  that  this  is  the  "best"  or  preferred 
scheme,  this  section  will  treat  In  detail  AM’s  representation  of  this  knowledge. 

We  have  seen  that  the  representation  of  a  concept  can  loosely  be  described  as  a  collection  of 
facet/value  pairs,  where  the  facets  are  drawn  from  a  fixed  set  of  about  25  total  possible 
facets. 

The  facets  break  down  into  three  categories; 

!.  Facets  which  relate  this  concept  C  to  some  other  one(s):  Generalizations, 
Specializations,  Examples,  Isa’s,  In-domain-of,  In-range-of,  Views,  Intu’s,  Analogies, 
Conjee’s 

2.  Facets  which  contain  information  intensive  to  this  concept  C:  Definitions, 
Algorithms,  Domain/Range,  Worth,  Interest 
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3.  Sub-facets,  containing  heuristics,  which  can  be  tacked  onto  facets  from  either  group 
above.  These  include:  Suggest,  Fillin,  Check 

Some  facets  comt*  in  several  flavors  (e.g.,  there  are  really  four  separate  facets  -  not  just  one 
-  which  point  to  Examples:  boundary,  typical,  just-barely-failing,  foibles). 

This  section  will  cover  each  facet  in  turn.  Let’s  begin  by  listing  each  of  them.  For  a  chan  e 
of  pace,  we’ll  show  a  typical  question  that  each  one  might  answer  about  concept  C:' ' 

Name:  What  shall  we  call  C  when  communicating  with  the  user? 

Generalizations:  Which  other  concepts  have  less  restrictive  definitions  than  C? 
Specializations:  Which  concepts  satisfy  C’s  definition  plus  some  additional  constraints? 
Examples:  What  are  some  things  that  satisfy  C’s  definition? 

Isa’s:  Which  concepts’  definitions  does  C  itself  satisfy?'^ 

In-domain-of:  Which  operations  can  be  performed  on  C’s? 

In-range-of:  Which  operations  result  in  values  which  are  C’s? 

Views:  How  can  we  view  some  other  kind  of  entity  as  if  it  were  a  C? 

Intu’s:  What  is  an  abstract,  analogic  representation  for  C? 

Analogies:  Are  there  similar  (though  formally  unrelated)  concepts? 

Conjee’s:  What  are  some  potential  theorems  involving  C? 

Definitions:  How  can  we  tell  if  x  is  an  example  of  C? 

Algorithms:  How  can  we  execute  the  operation  C  on  a  given  argument? 

Domain/Range:  What  kinds  of  arguments  can  operation  C  be  executed  on?  What 
kinds  of  values  will  it  return? 

Worth:  How  valuable  is  C?  (overall,  aesthetic,  utility,  etc.) 

Interestingness:  What  special  features  make  a  C  especially  interesting? 

In  addition,  each  facet  F  of  concept  C  can  possess  a  few  little  subfacets  which  contain 
heuristics  for  dealing  with  that  facet  of  C’s: 

F.Fillin:  How  can  entries  on  C.F  be  filled  in?  These  heuristics  get  called  on  when  the 
current  task  is  "Fillin  facet  F  of  concept  X",  where  X  is  a  C. 

F.Check:  How  can  potential  entries  on  C.F  be  checked  and  patched  up? 

F.Suggest:  If  AM  gets  bogged  down,  what  are  some  new  tasks  (related  to  C.F)  it  might 
consider? 

We’ll  now  begin  delving  into  the  syntax  and  semantics  of  each  facet,  one  by  one.  Future 
chapters  will  not  depend  on  this  material.  The  reader  may  wish  to  skip  to  Section  5.3  (page 
105). 


.1.  Generalizations/Soecializations 


In  tht*  dMcuttion,  ’C"  raprtitntt  <h«  ntm«  of  th«  conetpt  who**  fK«t  i*  btinc  di*cu««*(i,  and  may  b*  raad  ”ih*  (ivan 
concapt”. 

Notk*  that  C  will  lharafor*  b*  an  axampi*  of  atch  mambar  of  laa’atC). 


Chiptvr  5 


AM:  Oiscoviry  in  Mathamttict  at  Hturittic  Search 


-68- 


Generalization  makes  possible  conscious,  controlled,  and  accurate  accomodation  of 
one's  existing  schemas,  not  only  in  response  to  the  demands  for  assimilation  of 
new  situations  as  they  are  encountered,  but  ahead  of  these  demands,  seeking  or 
creating  new  examples  to  fit  the  enlarged  concept. 

—  Skt^p 


We  say  concept  A  "is  a  generalization  of  concept  B  iff  every  example  of  B  is  an  example  of 
A.  Equivalently,  this  is  true  iff  the  definition  of  B  can  be  phrased  as  "X  (x)  tA.Defn(x)  and 
P(x)]'';  that  is,  for  x  to  satisfy  B’s  definition,  it  must  satisfy  A’s  definition  plus  some 
additional  predicate  P.  The  Generalizations  facet  of  concept  C  will  be  abbreviated  as 
C.Genl. 

C.Genl  does  not  contain  all  generalizations  of  C;  rather,  just  the  "immediate"  ones.  More 
formally,  if  A  is  a  generalization  of  B,  and  B  of  C,  then  C.Genl  will  not  contain  a  pointer  to 
A.  Instead,  C  will  point  to  B'^. 

Here  are  the  recursive  equations  which  permit  a  search  process  to  quickly  find  all 
generalizations  or  specializations  of  a  given  concept  X: 

Gen«ralizations(X)  *  Genl*(X}  *  {X}  U  Gan«ralizations(X.Genl) 

SpecializationsIX)  «  Spec*(X)  *  {X}  U  Spacializaiions(X.Spac) 

For  the  reader’s  convenience,  here  are  the  similar  equations,  presented  elsewhere  in  the  text, 
for  finding  all  examples  of  -  and  Isa’s  of  -  X: 

Examplas(X)  ■  Spac*(Exs(Spae*(X))) 

Isa'sfX)  >  G«nl*(lsa(Ganl*(X))} 

The  format  of  the  Generalizations  facet  is  quite  simple:  it  is  a  list  of  concept  names.  The 
Generalizations  facet  for  Odd-primes  might  be: 

(Odd-numbers  Primes) 


13 


C*n«r*l,  C.G«nl  will  contain  tn  entry  Xl;  Xl.GenI  will  contain  an  entry  X2; Xn.Gtnl  will  contain  B  at  one  entry;  BGenI 
will  contain  Ylu.;  YnGonl  will  contain  A. 
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Here  is  a  small  diagram  representing  generalization  relationships.  The  only  lines  drawn 
represent  the  pointers  found  in  the  Gent  facets  of  these  concepts: 

Object 


Odd-number 8  Primes 


flersenne-pr  imes 


Each  of  those  lines  represents  an  arrow  which  slants  upwards,  Indicating  a  GenI  link.  For 
example,  we  see  that  the  Generalizations  facet  of  Odd-primes  contains  pointers  to  both 
Odd-numbers  and  to  Primes.  There  is  no  pointer  from  Odd-primes  upward  to  Number, 
because  there  is  an  "intermediate"  concept  (namely,  Primes).  There  is  no  pointer  from 
Mersenne-primes  to  Object,  since  a  chain  of  intermediate  concepts  links  them. 

The  reason  for  these  strange  constraints  is  so  that  the  total  number  of  links  can  be 
minimized.  There  is  no  harm  if  a  few  redundant  ones  sneak  in.  In  fact,  frequently-used 
paths  are  granted  the  status  of  single  links,  as  we  shall  soon  see. 

We’ve  been  talking  about  both  Specializations  and  Generalizations  as  If  they  were  very 
similar  to  each  other.  It’s  time  to  make  that  more  explicit: 
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Specializations  are  the  converse  of  Generalizations.  The  format  is  the  same,  and  (hopefully) 
A  is  an  entry  on  B’s  Specializations  facet  iff  B  is  an  entry  on  A’s  Generalizations  facet. 

The  uses  of  these  two  facets  are  many: 

1.  AM  can  sometimes  establish  independently  that  A  is  both  a  generalization  and  a 

specialization  of  B;  in  that  case,  AM  would  like  to  recognize  that  fact  easily,  so  it 
can  conjecture  that  A  and  B  specify  equivalent  concepts.  Such  coincidences  are 
easily  detected  as  cycles  in  the  Genl  (or  Spec)  graph.  In  these  cases,  AM  may 
physically  merge  A  and  B  (and  all  the  other  concepts  in  the  cycle)  into  one  concept. 

2.  Sometimes,  AM  wants  to  assemble  a  list  of  all  specializations  (or  generalizations)  of 

X,  so  that  it  can  test  whether  some  statement  which  is  just  barely  true  (or  false)  for 
X  will  hold  for  any  of  those  specializations  of  X. 

3.  Sometimes,  the  list  of  generalizations  is  used  to  assemble  a  list  of  isa’s;  the  list  of 

specializations  helps  assemble  a  list  of  examples.*'* 

4.  A  common  and  crucial  use  of  the  list  of  generalizations  is  to  locate  all  the  heuristic 

rules  which  are  relevant  to  a  given  concept.  Typically,  the  relevant  rules  are  those 
tacked  onto  Isa’s  of  that  concept,  and  the  list  of  Isa’s  is  built  up  from  the  list  of 
generalizations  of  that  concept.  This  was  also  mentioned  on  page  56. 

5.  To  incorporate  new  knowledge.  If  AM  learns,  conjectures,  etc.  that  A  is  a 
specialization  of  B,  then  all  the  machinery  (all  the  theorems,  algorithms,  etc.)  for  B 
become  available  for  working  with  A. 

Here  is  a  little  trick  that  deserves  a  couple  paragraphs  of  its  own.  AM  stores  the  answers  to 
common  questions  (like  "What  are  all  the  specializations  of  Operation")  explicitly,  by 
Intentionally  permitting  redundant  links  to  be  maintained.  If  two  requests  arrive  closely  in 
time,  to  test  whether  A  is  a  generalization  of  B,  then  the  result  is  stored  by  adding  "A"  as 
an  entry  on  the  Generalizations  facet  of  B,  and  adding  "B"  as  a  new  entry  on  the 
Specializations  facet  of  A.  The  slight  extra  space  is  more  than  recompensed  in  epu  time 
saved. 

If  the  result  were  False  (A  turned  out  not  to  be  a  generalization  of  B)  then  the  links  would 
specify  that  finding  explicitly,  so  that  the  next  request  would  not  generate  a  long  search 
again.  Such  failures  are  recorded  on  two  additional  facets:  Genl-not  and  Spec-not.  Since 
most  concept  pairs  A/B  are  related  by  Spec-not  and  by  Genl-not,  the  only  entries  which  get 
recorded  here  are  the  ones  which  were  frequently  called  for  by  AM.  If  space  ever  gets  tight, 
all  such  facets  can  be  wiped  clean  with  no  permanent  damage  done. 

These  two  "shadow"  facets  (Genl-not  and  Spec-not)  are  not  useful  or  Interesting  in  their  own 
right.  If  AM  ever  wished  to  know  all  the  concepts  which  are  not  generalizations  of  C,  the 
fastest  way  would  be  to  take  the  set-difference  of  all  concepts  and  Generalizations(C).  Since 
they  are  quite  incomplete,  Genl-not  and  Spec-not  are  used  more  like  a  cache  memory:  they 
save  time  whenever  they  are  applicable,  and  don’t  really  cost  much  when  they  aren't 
applicable.  Because  of  their  superfluity,  these  two  facets  will  not  be  mentioned  again.  I  only 
mentioned  them  above  because  they  do  greatly  speed  up  AM’s  execution  time,  and  because 
they  may  have  some  psychological  analog. 
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5.2.2.  Examples/lsa’s 


Usually,  to  show  that  a  definition  implies  no  contradiction,  we  proceed  hn  example. 
we  try  to  make  an  example  of  a  thing  satisfying  the  definition.  We  wish  to  define 
a  notion  A,  and  we  say  that,  by  definition,  an  A  is  anything  for  which  certain 
postulates  are  true.  If  we  can  demonstrate  directly  that  all  these  postulates  are 
true  of  0  certcdn  object  B,  the  definition  will  be  justified;  the  olject  B  will  be  an 
example  of  an  A. 


—  Poinear«’ 


Following  Poincare’,  we  say  "concept  A  is  an  example  of  concept  B”  iff  A  satisfies  B’s 
definition.'®  Equivalently,  we  say  that  "A  isa  B".  It  would  be  legal  (in  that  situation)  for  "A" 
to  be  an  entry  on  B.Exs  (the  Examples  facet  of  concept  B)  and  for  "B"  to  be  an  entry  on 
A.Isa  (the  Isa’s  facet  of  concept  A).  Some  earlier  mention  of  the  Examples  and  Isa’s  facets 
can  be  seen  in  Chapter  4,  page  57. 

The  Examples  facet  of  C  does  not  contain  all  examples  of  C;  rather,  just  the  ’’immediate" 
ones.  The  examples  facet  of  Numbers  will  not  contain  "H"  since  it  is  contained  In  the 
examples  facet  of  Odd-primes.  A  "rippling"  procedure  is  used  to  acquire  a  list  of  all 
examples  of  a  given  concept.  The  basic  equation  is: 

Examplts(x)  ■  $pecislizaiion8(Exs(Sp«cializations(x))) 

where  Exs(x)  is  the  contents  of  the  examples  facet  of  x.  Examples(x)  represents  the  final  list 
of  all  known  items  which  satisfy  th?  definition  of  X.  Examples(x)  thus  must  include  Exs(x). 

Specializations(x)  might  be  more  regularly  written  Spec’'‘(x).  That  Is,  all  members  of  x.Spec, 
all  members  of  their  Spec  facet,  etc.  Note  the  similarity  of  this  to  the  formula  for  Isa*s(x), 
given  on  page  57.  We  could  also  write  the  above  equation  as  follows: 

Examplti(x)  *  Spac*(Ext(Sp«e*(x))) 

As  an  illustration,  we  shall  show  how  AM  would  recognize  that  "3"  is  an  example  of 
Object: 


Wh«t  do**  thi*  tn**n’  B.Dtfn  i*  *  Ll*p  pr*dlc*t*,  *  Limbd*  txprtttion.  If  it  i*  fod  A  at  it*  argumant,  and  it  r*turn*  Tru*, 
w*  tay  that  A  i»  a  B,  or  that  A  «ati*fi*«  B’«  definition  if  B.Dtfn  raturna  NIL,  w*  tay  that  A  i«  not  a  B,  or 
that  A  fail*  B’t  dafinition.  If  B.Oafn  runt  out  of  tima  bafor*  raturning  a  T/NIL  valua,  thar*  it  no  dafinit* 
ttatamant  of  thi*  form  wt  can  maka  In  that  cata,  AM  might  chack  to  taa  whathar  A  tatitfi**  th*  dafinition 
of  tom*  tpacialization  of  6,  or  whathar  A  fail*  th*  dafinition  of  tom*  g*naraliz*tion.of  B. 


-"w-TV  %"■  VI  '•  ‘  %  •’ 
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Ob  ject 


flersenne-pr ’lines 


3 


As  the  graph  above  shows,  AM  would  ripple  in  the  Spec  direction  i  times,  moving  from 
Object  all  the  way  to  Mersenne-primes;  then  descend  once  in  the  Exs  direction,  to  reach  "3"; 
then  ripple  0  more  times  in  the  Spec  direction.  Thus  "3"  is  seen  to  be  an  example  of 
Object,  according  to  the  above  formula.  Similarly,  we  see  that  "3"  is  also  an  example  of 
Number,  of  Primes,  of  Odd-number,  of  Odd-primes,  and  of  course  an  example  of 
Mersenne-primes. 

As  with  Generalizations/Specializations,  the  reasons  behind  the  incomplete  pointer  structure 
is  simply  to  save  space,  and  to  minimize  the  difficulty  of  updating  the  graph  structure 
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whenever  new  links  are  found.  Suppose  a  new  Mersenne  prime'®  is  computed.  Wouldn’t  it 
be  nice  simply  to  add  a  single  entry  to  the  Exs  facet  of  Mersenne-primes,  rather  than  to 
have  to  update  the  Exs  pointers  from  a  dozen  concepts? 

There  is  no  harm  if  a  few  redundant  links  sneak  in.  In  fact,  frequently-used  paths  are 
granted  the  status  of  single  links.  If  two  requests  arrive  closely  in  time,  to  test  whether  A 
isa  B,  then  the  result  is  stored  as  an  entry  on  the  Isa  facet  of  A,  and  the  Exs  facet  of  B.  If 
the  result  were  False,  then  the  links  would  specify  that,  so  that  the  next  request  would  not 
generate  a  long  search.  In  fact,  there  is  a  separate  facet  called  Exs-not,  and  one  called  Isa- 
not.  These  two  shadowy  facets  are  quite  analogous  to  the  unmentionable  facets  "Genl-not" 
and  "Spec-not",  discussed  In  the  previous  subsection. 

"Isa’s"  is  the  converse  of  "Examples".  The  format  is  the  same,  and  (if  A  and  B  are  both 
concepts)  A  Is  an  entry  on  B.lsa  iff  B  is  an  entry  on  A.Exs.  In  other  words,  A  is  a  member 
of  Examples(B)  Iff  B  Is  a  member  of  !sa’s{A).  Due  to  an  ugly  lack  of  standardization,  non¬ 
concepts  are  allowed  to  exist.  Thus,  "3"  Is  an  example  of  Primes,  but  is  not  itself  a  concept. 
Examples  of  X  sometimes  are  concepts,  of  course:  "Intersectointersect"  is  an  example  of 
Compose-with-self.  And  Isa’s(x)  are  always  concepts.  The  highest  level  concept  is  called 
"Anything".  Its  definition  Is  the  atom  T.  That  is,  "X(x)  T".  This  high-level  concept  can  claim 
everything  as  its  examples. 

The  uses  of  the  Exs/Isa’s  facets  are  similar  to  those  for  Genl/Spec  (see  previous  subsection). 

Their  formats  are  quite  a  bit  more  complicated  than  the  Genl/Spec  facets’  formats,  when  we 
finally  get  to  the  implementation  level,  however.  There  are  really  a  cluster  of  different  facets 
all  related  to  Examples; 

1.  TYPICAL:  This  is  a  list  of  average  examples.  Care  must  be  taken  to  include  a  wide 
spectrum  of  allowable  kinds  of  examples.  For  "Sets",  these  would  include  sets  of 
varying  size,  nesting,  complexity,  type  of  elements,  etc. 

■  2.  BOUNDARY:  Items  which  just  barely  pass  the  definition  of  this  concept.  This 
might  include  items  which  satisfy  the  base  step  of  a  recursive  definition,  or  items 
which  were  intuitively  believed  to  be  non-examples  of  the  concept.  For  "Sets",  this 
might  include  the  empty  set. 

3.  BOUNDARY-NOT:  Items  which  just  barely  fail  the  definition.  This  might  include 

an  item  which  had  to  be  slightly  modified  during  checking,  like  {A,B,A}  becoming 
{A,B}. 

4.  FOIBLES;  Total  failures.  Items  which  are  completely  against  the  grain  of  this 
concept.  For  "Sets",  this  might  include  the  operation  "Compose". 

5.  NOT:  This  is  the  "cache"  trick  used  to  store  the  answers  to  frequently-asked 
questions.  If  AM  frequently  wants  to  know  whether  X  is  an  example  of  Y,  and  the 
answer  is  No,  then  much  time  can  be  saved  by  adding  X  as  an  entry  to  the  Exs-not 
facet  of  Y. 

An  individual  item  on  these  facets  may  just  be  a  concept  name,  or  it  may  be  more 
complicated.  In  the  case  of  an  operation,  it  is  an  item  of  the  form  <a|a2...-+v>;  i.e.,  actual 
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arguments  and  the  value  returned.  In  the  case  of  objects,  it  is  an  object  of  that  form.  An 
Exs  facet  of  the  concept  Sets  might  contain  {a}  as  one  entry. 

Here  is  a  more  detailed  illustration.  Consider  the  Examples  facet  of  Set-union.  It  might 
appear  thus; 

TYPICAL:  {A}U{A,B}-*{A,B}; 

{A,B}U{A,BH{A,B}; 

{A,<3,4,3>,{A,B}}U{3,A}-+{A,<3,4,3>,{A,B},3}. 

BOUNDARY:  {}UX“»X 

BOUNDARY-NOT:  {A,B}U{A,C}“»{A,B,A,C}; 

{A,B,C,D)U{E.F,G,H,I.JH{A,B,C,E,F,G,H,I,J} 

FOIBLES:  <2,A,2> 

NOT:  no  •ntrios 

The  format  for  Isa’s  are  much  simpler;  there  are  only  two  kinds  of  links,  and  they’re  each 
merely  a  list  of  concept  names.  Here  is  the  Isa  facet  of  Set-union; 

ISA:  (Oporaiion'^  Domain«Rang«‘Op) 

ISA-NOT:  (Structura  Composition  Pradicata) 

At  some  time,  some  rule  asked  whether  Set-union  isa  Composition.  As  a  result,  the  negative 
response  was  recorded  by  adding  "Composition"  to  the  Isa-not  facet  of  Set-union,  and 
adding  "Set-union"  to  the  Exs-not  subfacet  of  the  Examples  facet  of  the  concept 
Composition  (indicating  that  Set-union  was  definitely  not  an  example  of  Composition,  yet 
there  was  no  reason  to  consider  it  a  foible). 


We  shall  say  that  A  is  in  the  domain  of  B  (written  "A  In-dom-of  B")  iff 

1.  A  and  B  are  concepts 

2.  B  isa  Operation 

3.  A  is  equal  to  (or  at  least  a  specialization  of)  one  of  the  domain  components  of  the 

operation  B.  That  is,  B  can  be  executed  using  any  example  of  A  as  one  of  its 
arguments.'® 

For  example.  Odd-perfect-squares  is  In-dom-of  Add,  since  Odd-perfect-squares  is  a 

specialization  of  Numbers,  and  Numbers  is  one  component  of  the  following  entry  which  is 

'  ^  Actutlly>  AM  It  not  quit*  tmirt  tnough  to  utt  th«  viritbit  X  «t  thown  in  th«  boundary  txampltt.  It  would  timply  ator*  a 
ftw  inatancaa  of  thit  (antral  rula,  pkit  hava  an  antry  of  tha  form  <Equivalant.  Idantity(X)  and  Sat- 
union(X,{})>  on  tha  Ext  facat  of  Conjacturaa.  Notka  that  bacauaa  of  tha  aaymmatric  way  Sat-union  waa 
dafinad,  only  ona  bptidad  boundary  axampla  waa  found.  If  anothar  dafinition  wara  tuppliad,  tha  convarta 
kind  of  boundary  axamplaa  would  ba  found. 

1 8 

Thia  antry  it  radundant. 

'®  Mora  formally,  wa  can  aay  that  thia  occura  whanavar  tema  antry  on  tha  Domain/ran(a  facat  of  B  hat  tha  form  <D|  . 

D,  -»  R>  with  aoma  Dj  a  mambar  of  GanaralaationaCA).  Than  A  it  a  apacialixation  of  toma  domain  componant 
of  aoma  antry  on  B  Oomain/ranfa. 
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located  on  Add.Domain/range;  <Numbcrs  Numbers  -»  Numbers>.  Since  Odd-perfect-squares 
is  a  specialization  of  Numbers,  the  operation  ‘Add’  can  be  executed  using  any  example  of 
Odd-perfect-squares  as  its  argument. 

As  another  example,  Odd-perfect-squates  is  also  In-dom-of  Set-insert,  one  of  whose 
Domain/range  entries  is  <Anything  Sets  -♦  Sets>.  This  is  because  Odd-perfect-squares  is  a 
specialization  of  Anything.  So  Set-insert  is  executed  on  two  arguments,  and  the  hrst 
argument  can  be  any  example  of  Odd-perfect-squares  (the  second  argument  must  be  an 
example  of  Sets).^° 

Although  it  can  be  recomputed  very  easily,  we  may  wish  to  record  the  fact  that  A  In-dom-of 
B  by  adding  the  entry  "B"  to  the  In-dom-of  facet  of  A.  AM  may  even  wish  to  add  this  new 

entry  to  the  Domain/range  facet  of  B  (where  A  is  a  specialization  of  the  domain 
component  of  B): 

<  D|  D2...  Dj.j  A  Dj+j...  Dj  -♦  R>.  The  two  examples  given  above  would  produce  new 

domain/range  entries  of  <Odd-perfect-squares  Numbers  -♦  Numbers>  for  Add,  and  <Odd- 
perfect-squares  Sets  -*  Sets>  for  Set-insert. 

The  semantic  content  of  "In-dom-of"  is:  what  can  be  done  to  any  example  of  a  given 
concept  C?  Given  an  example  of  concept  C,  what  operations  can  be  run  on  that  thing? 
Here  are  some  illustrations: 

"Odd-perfect-squares  In-dom-of  Set-insert"  tells  us  that  Set-insert  can  be  run  on  any 
particular  Odd-perfect-square  we  can  grab  hold  of. 

"Operation  In-dom-of  Compose"  tells  us  that  Compose  can  be  run  on  any  operation  we 
want. 

"Dom-Range-operation  In-dom-of  Compose"  tells  us  that  Compose  can  be  run  on  any 
operation  which  has  its  range  equal  to  one  of  its  domain  components. 

"Primes  In-dom-of  Squaring"  tells  us  that  we  can  apply  the  operation  Squaring  to  any 
particular  prime  number  we  wish. 

Let  us  now  turn  from  In-dom-of  to  the  related  facet  In-ran-of. 

We  say  that  concept  A  is  in  the  range  of  B  iff  B  is  an  Activity^'  and  A  is  a  specialization 
of  the  range  of  B.  More  precisely,  we  can  say  that  "A  In-ran-of  B"  iff 

1 .  A  and  B  are  concepts 

2.  B  isa  Operation  (i.e.,  B  is  an  example  of  the  concept  "Operation") 

3.  Some  entry  on  the  Domain/range  facet  of  B  has  the  form  <D|  D2...  Dj  -+  R>  with  R 

a  generalization  of  A. 

For  example.  Odd-perfect-squares  is  In-ran-of  Squaring,  since  (1)  both  of  those  are  concepts, 
(2)  Squaring  is  an  operation,  (3)  one  of  its  Domain/range  entries  is  <Numbers-»Perf- 

^  Sine*  Odd-p«rf«ct-(qu«rt*  i*  mor*  clOMly  rdittd  to  Nutnbtrt  than  w  (ho  concopi  Anythinf  (half  aa  many  GonI  hnka 
away),  AM  oxpacta  that  raatrictinf  Add  to  Odd-porfact-aquaroa  will  probably  yiald  a  mora  promiainc  now 
oporation  than  raatrictinf  Sat-inaart  to  only  inaart  odd  parfact  aquaraa  into  aata. 

21 

la.,  iff  9  ias  Activa,  iff  8cExiirip!as(Activa),  iff  Activa.{)afn(8)>Trua.  Actually,  ainca  tha  ranga  of  Pradicataa  ia  maraly 
{T,F},  wa  may  aa  wall  aaautna  that  B  ia  an  eparation,  not  a  pradkata.  Thia  ia  in  fact  aaaumad,  in  tha  taxt  and 
in  tha  actual  AM  ayatam. 
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squarej>,  and  Perf-squares  is  a  generalization  of  Odd-perfect-squares^^. 

Here  is  what  the  In-ran-of  facet  of  Odd-perfect-squares  might  look  like: 

(Squaring  Add  TIMES  Maximum  Minimum  Cubing) 

Each  of  these  operations  will  -  at  least  sometimes  -  produce  an  odd  perfect  square  as  its 
result. 

Semantically,  the  In-ran-of  relation  between  A  and  B  means  that  one  might  be  able  to 
produce  examples  of  A  by  running  operation  B.  Aha!  This  is  a  potential  mechanism  for 
finding  examples  of  a  concept  A.  All  you  need  do  is  get  hold  of  In-ran-of(A),  and  run  each 
of  those  operations.  Even  more  expeditious  is  to  examine  the  Examples  facets  of  each  of 
those  operations,  for  already-run  examples  whose  values  should  be  tested  using  A.Defn,  to 
see  if  they  are  examples  of  A’s.  AM  relies  on  this  in  times  of  high  motivation;  it  is  too 
"blind"  a  method  to  use  heavily  all  the  time. 

This  facet  is  also  useful  for  generating  situations  to  investigate.  Suppose  that  the 
Domain/rapge  facet  of  Doubling  contains  only  one  entry:  <  Numbers  -»  Numbers  >.  Then 
syntactically.  Odd-numbers  is  in  the  range  of  Doubling.  Eventually  a  heuristic  rule  may 
have  AM  spend  some  time  looking  for  an  example  of  Doubling,  where  the  result  was  an 
odd  number.  If  none  is  quickly  found,  AM  conjeefres  that  it  never  will  be  found.  Since 
one  definition  of  Odd-"jmber(x)  is  "Number(x)  and  Not(£ven-number(x))",  the  only  non¬ 
odd  numbers  are  even  numbers.  So  AM  will  increment  the  Domain/range  facet  of 
Doubling  with  the  entry  <Numbers->Even-numbers>,  and  remove  the  old  entry.  Thus  Odd- 
numbers  will  no  longer  be  In-dom-of  Doubling.  AM  can  of  course  chance  upon  this 
conjecture  in  a  more  positive  way,  by  noticing  that  all  known  examples  of  Doubling  have 
results  which  are  examples  of  Even-numbers.^^. 

A  mere  productive  result  is  suggested  by  examining  the  cases  where  Odd-perfect-squares 
are  the  result  of  cubing.  The  smallest  such  odd  numbers  are  1,  729,  and  156?5.  In  general, 

these  numbers  are  all  those  of  the  form  (2n+l)®.  How  could  AM  notice  such  an  awkward 
relationship? 


The  general  question  to  ask,  when  A  In-ran-of  B,  is  "What  is  the  set  of  domain  items  whose 
values  (under  the  operation  B)  are  A's?"  In  case  the  answer  is  "AH"  or  "None",  some  special 
modifications  can  be  made  to  the  Domain/rangc  facets  and  In-dom-of,  In-ran-of  facets  of 
various  concepts,  and  a  new  conjecture  can  be  printed.  In  other  cases,  a  new  concept  might 
get  created,  representing  precisely  the  set  of  all  arguments  to  B  which  yield  values  in  A.  If 
you  will,  this  is  the  inverse  image  of  A,  under  operation  B.  In  the  case  of  B  a  predicate, 
this  might  be  the  set  of  all  arguments  which  satisfy  the  predicate. 


22 

Why?  B*c*u«*  G«n*r*lT*1ion«(0dd-p«rf*ct-«qu«r**)  i*  th«  of  conetpt*  (Odd-numtxr*  Ptrf-tquar**  Number* 
Object*  Any-concept  Anylhini},  hence  contiin*  Perf-equtre*.  So  Perf-*qu«re*  I*  *  (enerilizition  of  Odd- 
perfect-equire*. 

23 

ThI*  poeitive  tpproKh  I*  in  f«ct  the  way  AM  noticed  thi*  particuler  retationehip. 

^  Wronj  That  waf  an  exponent,  not  a  footnote' 
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In  the  case  of  B»Cubing  and  A=Odd-perfect-squares,  the  heuristic  mentioned  above  will 
have  AM  create  a  new  concept:  the  inverse  image  of  Odd-perfect-squares  under  the 
operation  of  Cubing.  That  is,  find  numbers  whose  cubes  are  Odd-perfect-squares.  It  is 
quickly  noticed  that  such  numbers  are  precisely  the  set  of  Odd-perfect-squares  themselves! 
So  The  Domain/range  facet  of  Cubing  might  get  this  new  entry:  <Odd-perfcct-squares  -♦ 
Odd-perfect-squares>.  But  not  all  squares  can  be  reached  by  cubing,  only  a  few  of  them 
can.  AM  will  notice  th’s,  and  the  new  range  would  then  be  isolated  and  might  be  rtiiamed 
by  the  user  "Perfect-s .  powers".  Note  that  all  this  was  brought  on  by  examining  the  In- 
ran-of  facet  of  Odd-p  ’ct-squares.  "Cubing"  was  just  one  of  the  seven  entries  there. 
There  are  six  more  .-fo'-ic..  to  tell  in  this  ti.iy  nook  of  AM's  activities. 

How  exactly  doe.-  •.  ^,0  about  gathering  th:  In-ran-o.'  and  In-dom-of  lists?  Given  a 

concept  C,  AM  can  scan  down  the  ;:!obal  tree  of  operations  (the  Exs  and  Spec  links  below 
the  concept  'Active').  For  if  C  -s  not  In-dom  ...f  F,  it  certainly  won't  be  In-dom-of  any 
specialization  of  F.  Similarly,  it  can't  be  produced  by  F,  it  won't  be  produced  by  any 
specialization  of  F.  If  you  can’t  g  /  using  Doubling  you'll  never  get  it  by  Quadrupling.  So 
AM  simply  ripples  around,  as  usual.  The  precise  code  for  this  algorithm  is  of  little  interest. 
There  are  not  that  many  operations,  and  it  is  cheap  to  tell  whether  X  is  a  specialization  of 
a  given  concept,  so  even  an  exhaustive  search  wouldn't  be  prohibitive.  Finally,  recall  that 
such  a  search  is  not  done  all  the  time.  It  will  be  done  initially,  pe-haps,  but  after  that  the 
In-dom-of  and  In-ran-of  networks  will  only  need  slight  updating  now  and  then. 


5.2.4.  Views 

Often,  two  concepts  A  and  B  will  be  inequivalent,  yet  there  will  be  a  "natural"  bijection 
between  one  and  (a  subset  of)  the  other.  For  example,  consider  a  finite  set  S  of  atoms,  and 

consider  the  set  of  all  its  subsets,  2^,  also  called  the  power  set  of  S.  Now  S  is  a  member  of, 

but  not  a  subset  of,  2^  (e.g.,  if  S={x,y,...},  then  x  is  not  a  member  of  2^).  On  the  other  hand, 
we  can  identify  or  view  S  as  a  subset  by  the  mapping  v-*{v}.  Then  S  is  associated  with  the 

following  subset  of  2^:  {  {x},  {y},... }.  Why  would  we  want  to  do  this?  Well,  it  shows  that  S  is 

identified  with  a  proper  subset  of  2^,  and  indicates  that  S  has  a  lower  cardinality 
(remember:  all  sets  are  finite). 


As  another  example,  most  of  us  would  agree  that  the  set  {x,  {y},  z}  ran  be  associated  with 
the  following  bag:  (x,  {y},  z).  Each  of  them  can  be  viewed  as  the  other.  Sometimes  such  a 
viewing  is  not  perfectly  natural,  or  isn't  really  a  bijection:  how  could  the  bag  (2,  2,  3)  be 
viewed  as  a  set?  Is  {2,3}  better  or  worse  than  ;?,{?},3)? 


The  View  facet  of  a  concept  C  describes  how  to  view  instances  of  another  concept  D  as  if 
they  were  C's.  For  example,  this  entry  on  the  View  facet  of  Sets  explains  how  to  view  any 
given  structure  as  if  it  were  a  Set: 


Structure:  X  (x)  Enclose-in-braces(Sort(Rflmove-multiple-«lements(x))} 


ir  OTM/On 

*«  *  wi «  vi  < 


list  < 


Jv.UtN'tUo* 


ivilC  piijgidiii  vvijUiu  iciiiuVt'  inUitipic  elements  (leaving 
<z,a,c>),  sort  the  structure  (making  it  <'a,c,z>),  and  replace  the  "<...>"  by  leaving  the 
final  value  as  {a,c,z}.  Note  that  this  transformation  is  not  I- 1;  the  list  <a,c,z>  would  get 
transformed  Into  this  same  set.  On  the  other  hand,  it  may  be  more  useful  than 
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transforming  the  original  list  into  {2,{a,{c,{a}}}}  which  retains  the  ordering  and  multiple 
element  information.  Both  of  those  transformations  may  be  present  as  entries  on  the  View 
facet  of  Sets. 

As  it  turns  out,  the  View  facet  of  Sets  actually  contains  only  the  following  information: 
Structure:  X  (x)  Enclose-in-bracestx) 

Thus  the  Viewing  will  produce  entities  which  are  not  quite  sets.  Eventually,  AM  will  get 
around  to  executing  a  task  of  the  form  "Check  Examples  of  Sets",  and  at  that  time  the  error 
will  be  corrected.  One  generalization  of  Sets  is  No-multiple-elements-Structures,  and  one  of 
its  entries  under  Examples.Check  says  to  remove  all  multiple  elements.  Similarly, 
Unordered-structures  is  a  generalization  of  Sets,  and  one  of  its  Examples.Check  subfacet 
entries  says  to  sort  the  structure.  If  either  of  these  alters  the  structure,  the  old  structure  is 
added  to  the  Boundary-not  subfacet  (the  ‘Just-barely-miss’  kind)  of  Examples  facet  of  Sets. 

The  syntax  of  the  View  facet  of  a  concept  C  is  a  list  of  entries;  each  entry  specifies  the  name 
of  a  concept,  X,  and  a  little  program  P.  If  it  is  desired  to  view  an  instance  of  X  as  if  it  were 
a  C,  then  program  P  is  run  on  that  X;  the  result  is  (hopefully)  a  C.  The  programs  P  are 
opaque  to  AM;  they  must  have  no  side  effects  and  Le  quick. 

Here  is  an  entry  on  the  View  facet  of  Singleton: 

Anything:  X  (x)  Set-ins«rt(x,  PHI) 

In  other  words,  to  view  anything  as  a  singleton  set,  just  insert  it  into  the  empty  set.  Note 
that  this  is  also  one  way  to  view  anything  as  a  set.  As  you’ve  no  doubt  guessed,  there  is  a 
genera!  formula  explaining  this: 

Vi«ws(X)  I  View(Specializations(X)) 

Thus,  to  find  all  the  ways  of  viewing  something  as  a  C,  AM  ripples  away  from  C  in  the 
Spec  direction,  gathering  all  the  View  facets  along  the  way.  All  of  their  entries  are  valid 
entries  for  C.View  as  well. 

In  addition  to  these  built-in  ways  of  using  the  Views  facets,  some  special  uses  are  made  in 
individual  heuristic  rules.  Here  is  a  heuristic  rule  which  employs  the  Viewing  facets  of 
relevant  concepts  in  order  to  find  some  examples  of  a  given  concept  C: 

IF  the  currant  tack  ic  to  Fill-in  Examplas  of  C, 
and  C  has  soma  antries  on  its  Viaw  facat, 

and  ona  of  thosa  antrias  <X,P>  indicatas  a  concapt  X  which  has  soma  Known  Examplas, 
THEN  run  tha  associated  program  P  on  aach  mambar  of  ExamplesfX), 

and  add  tha  following  task  to  tha  agenda:  "Check  Examples  of  C",  for  the  followir.g 
reason:  "Some  very  risky  techniques  were  used  to  find  examples  of  C",  and 
that  reason's  rating  is  computed  as:  A'  »rage(Worth(X),  f|tha  examples  of  C 
found  in  this  mannar||). 

Ssy  the  '  sk  selected  from  the  agenda  was  "Fill-in  Examples  of  Sets".  We  saw  that  one 
entry  on  -ets.View  was  Structure:  X(x)  Encloca-in-bracas(x).  Thus  it  is  of  the  form  <X:P>, 
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with  X-Structure.  The  above  heuristic  rule  will  trigger  if  any  examples  of  Structures  are 
known.  The  rule  will  then  use  the  View  facet  of  Sets  to  find  some  examples  of  Sets.  So  AM 
will  go  off,  gathering  all  the  examples  of  structures.  Since  Lists  is  a  Specialization  of 
Structure,  the  computation  of  Examplcs(Structures)  will  eventually  ripple  downwards  and 
ask  for  Examples  of  Lists.  If  the  Examples  facet  of  Lists  contains  the  entry  <z,a,c,a,a>,  then 
this  will  be  retrieved  as  one  of  the  members  of  Examples(Structure).  The  heuristic  rule  takes 
each  such  member  in  turn,  and  feeds  it  to  Set.View’s  little  program  P.  In  this  case,  the 
program  replaces  the  list  brackets  with  set  braces,  thus  converting  <z,a,c,a,a>  to  {z,a,c,a,a}. 

In  this  manner,  all  the  existing  structures  will  be  converted  into  sets,  to  provide  examples  of 
sets.  After  all  such  conversions  take  place,  a  great  number  of  potential  examples  of  Sets  will 
exist.  The  final  action  of  the  right  side  of  the  above  heuristic  rule  is  to  add  the  new  task 
"Check  examples  of  Sets"  to  the  agenda.  When  this  gets  selected,  all  the  "slightly  wrong" 
examples  will  be  fixed  up.  For  example,  {z,a,c,a,a}  will  be  converted  to  {a,c,z}. 

If  any  reliance  is  made  on  those  unchecked  examples,  there  is  the  danger  of  incorrectly 
rejecting  a  valid  conjecture.  This  is  not  too  serious,  since  the  very  first  such  reliance  will 
boost  the  priority  of  the  task  "Chock  oxamplos  of  Sots",  and  it  would  then  probably  hs  the 
very  next  task  chosen. 


5.2.5.  Intuitions 


The  mathematician  does  not  work  like  a  machine;  we  cannot  overemphasize  the 
fundamental  role  played  in  his  research  by  a  special  intuition  (frequently  wroug), 
which  is  not  common-sense,  but  rather  a  divination  of  the  regular  behavior  he 
expects  of  mathematical  beings. 


mm  Bourbakl 


This  facet  turned  out  to  be  a  "dud",  and  was  later  excised  from  all  the  concepts.  It  will  be 
described  below  anyway,  for  the  benefit  of  future  researchers.  Feel  free  to  skip  directly  to 
the  next  subsection. 

The  initial  idea  was  to  have  a  set  of  a  few  (3-10)  large,  global,  opaque  LISP  functions.  Each 
of  these  functions  would  be  termed  an  "Intuition'  and  would  have  some  suggestive  name 
like  "jigsaw-puzzle",  "see-saw",  "archery",  etc.  Each  function  would  somehow  mcr’- 1  the 
particular  activity  implied  by  its  name.  There  would  be  a  multitude  of  parameter  -,’hich 
could  be  specified  by  the  "caller"  as  if  they  were  the  arguments  of  the  function.  The 
function  would  then  work  to  fill  in  values  for  any  unspecified  parameters.  That’s  all  the 
function  does.  The  caller  would  also  have  to  specify  which  parameters  were  to  be 
considered  as  the  "results"  of  the  function. 

For  the  see-saw,  the  caller  might  provide  the  weight  of  the  left-hand-side  sitter,  and  the  final 
position  of  the  see-saw,  and  ask  for  the  weight  of  the  right-hand  sitter.  The  function  would 
then  compute  that  weight  (as  any  random  number  greatcr/less-than  the  left-hand  weight, 


Chiptar  5 


AM:  Ditcoviry  in  Mtlhtnulic*  it  Hturiitic  Starch 


-80- 


depending  on  the  desired  tilt  of  the  board).  Or,  the  caller  might  specify  the  two  weights 
and  ask  for  the  final  position. 

The  See-saw  function  is  an  expert  on  this  subject;  it  has  efficient  code  for  computing  any 
values  which  can  be  computed,  and  for  randomly  instantiating  any  variables  which  may 
take  on  any  value  (e.g.,  the  first  names  of  the  people  doing  the  sitting).  When  an  individual 
call  is  made  on  this  function,  the  caller  is  not  told  how  the  final  values  of  the  variables  were 
computed,  only  what  those  values  end  up  as. 

So  the  Intuitions  were  to  be  experimental  laboratories  for  AM,  wherein  it  could  get  some 
(simulated)  real-world  empirical  data.  If  the  seesaw  were  the  Intuition  for  ">",  and  weight 
corresponded  to  Numbers,  then  several  relationships  might  be  visualized  intuitively  (like  the 
anti-symmetry  of  ">")•  This  is  a  nice  idea,  but  in  practice  the  only  relationships  derived  in 
this  way  were  the  ones  that  were  thought  up  while  trying  to  encode  the  Intuition  functions. 
This  shameful  behavior  led  to  the  excision  of  the  Intuitions  facets  completely  from  the 
system. 

As  another  example,  suppose  AM  is  considering  composing  two  relations  R  and  S.  If  they 
have  no  common  Intuition  reference,  then  perhaps  they’re  not  meaningfully  composable.  If 
they  do  both  tie  into  the  same  Intuition  function,  then  perhaps  that  function  can  tell  us 
something  about  the  composition.  This  is  a  nice  idea,  but  in  practice  very  few  prunings 
were  accomplished  this  way,  and  no  unanticipated  combinations  were  fused. 

Each  Intuition  entry  is  like  a  "way  in"  to  one  of  the  few  global  scenarios.  It  can  be 
characterized  as  follows: 

1.  One  of  the  salient  features  of  these  entries  -  and  of  the  scenarios  -  is  that  AM  is 

absolutely  forbidden  to  look  inside  them,  to  try  to  analyze  them.  They  are  opaque. 
Most  Intuition  functions  use  numbers  and  arithmetic,  and  it  would  be  pointless  to 
say  that  AM  discovered  such  concepts  if  it  had  access  to  those  algorithms  all  along. 

2.  The  second  characteristic  of  an  Intuition  is  that  it  be  fallible.  As  with  human 
intuition,  there  is  no  guarantee  that  what  is  suggested  will  be  verified  even 
empirically,  let  alone  formally.  Not  only  does  this  make  the  programming  of 
Intuition  functions  easier,  it  was  meant  to  provide  a  degree  of  "fairness"  to  them. 
AM  wasn’t  cheating  quite  as  much  if  the  See-saw  function  was  only  antisymmetric 
90%  of  the  time. 

3.  Nevertheless,  the  intuitions  are  very  suggestive.  Many  conjectures  can  be  proposed 

only  via  them.  Some  analogies  (see  the  next  subsection)  can  also  be  suggested  via 
common  intuitions. 

After  they  were  coded  and  running,  1  decided  that  the  intuition  functions  were  unfair;  they 
contained  some  major  discoveries  "built-in"  to  them.  They  had  the  power  to  propose 
otherwise-obscure  new  concepts  and  potential  relationships.  They  contributed  nothing  other 
than  what  was  originally  programmed  into  them;  they  were  not  synergetic.  Due  to  this 
dubious  character  of  the  contributions  by  AM’s  few  Intuition  functions,  they  were  removed 
from  the  system.  Ail  the  examples  and  all  the  discoveries  listed  in  this  document  were  made 
without  their  assistance. 

We  shall  now  drop  this  de-implemented  idea.  J  think  there  is  some  real  opportunity  for 
research  her’.  For  the  benefit  of  any  future  researchers  in  this  area,  let  me  point  to  the 
excellent  discussion  of  analogic  representations  in  [Sioman  71]. 
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5.2.6.  Analogies 


T he  whole  idea  of  analogy  is  that  'Effects’,  viewed  as  a  function  of  situation,  is  a 
continuous  function. 


—  Poincar*' 


As  with  Views  and  Intuitions,  this  facet  is  useful  for  shifting  between  one  part  of  the 
universe  and  another.  Views  dealt  with  transformations  between  two  specific  concepts; 
Intuitions  dealt  with  transformations  between  a  bunch  of  concepts  and  a  large  standard 
scenario  which  was  carefully  hand-crafted  In  advance.  In  contrast,  this  facet  deals  with 
transforming  between  a  list  of  concepts  and  another  list  of  concepts. 

I 

Analogies  operate  on  a  much  grander  scale  than  Views.  Rather  than  simply  transforming  a 
few  Isolated  items,  they  initiate  the  creation  of  many  new  concepts.  Unlike  Intuitions,  they 
are  not  limited  in  scope  beforehand,  nor  are  they  opaque.  They  are  dynamically  proposed. 

The  concept  of  "prime  numbers"  is  analogous  to  the  notion  of  "simple  groups".  While  not 
isomorphic,  you  might  guess  at  a  few  relationships  involving  simple  groups  just  by  my 
telling  you  this  fact:  simple  groups  are  to  groups  what  primes  are  to  numbers.^^ 

Let’s  take  3  elementary  examples,  involving  very  fundamental  concepts. 


1.  AM  was  told  how  to  View  a  set  as  if  it  were  a  bag. 

2.  AM  was  told  it  could  Intuit  the  relation  "i"  as  the  predetermined  "See-saw"  function. 

3.  AM,  by  ,'tself,  once  Analogized  that  these  two  lists  correspond: 

<Bags  Same-length  Operations-on-and-into  Bags> 

<Bags-of-T’s  Equality  Those  operations  restricted  to  Bags-of-T’s> 

The  concept  of  a  bag,  all  of  whose  elements  are  "T'”s,  is  the  unary  representation  of 
numbers  discovered  by  AM.  When  the  above  analogy  (*3)  is  first  proposed,  there  are  many 
known  Bag-operatlons^^  but  there  are  as  yet  no  numeric  operations^®.  This  triggers  one  of 
AM’s  heuristic  rules,  which  spurs  AM  on  to  finding  the  analogues  of  specific  Bag- 
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If  *  croup  i«  not  timple,  it  cm  be  fectored  Unfortunately,  the  factorization  of  a  group  Into  timole  groupt  it  not  unique. 

Another  analogizing  contact  For  each  prime  p,  we  can  aatoclate  the  cyclic  group  of  order  p,  which  te  of 
courae  eimple  AM  never  came  up  with  the  co.xept  of  aimple  groupt;  thia  ie  juet  an  illuttratisn  for  the 
eophieticatad  reader. 
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ie,  all  entriee  on  ln-dom-of<Bag)  and  ln-ran-of(Bag)i  a  few  of  theae  art.  Bag-ineert,  Bag-union,  Bag-interaection 
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operations.  That  is,  what  special  properties  do  the  bag-operations  have  when  their  domains 
and/or  ranges  are  restricted  from  Bags  to  Bags-of-T’s  (i.e,  Numbers).  In  this  way,  in  fact, 
AM  discovers  Addition  (by  restricting  Bag-union  to  the  Domain/range  <Bags-of-T’s  Bags- 
of-T’s  -»  Bags-of-T’s>),  plus  many  other  nice  arithmetic  functions. 

Well,  if  it  leads  to  the  discovery  of  Addition,  that  analogy  is  certainly  worth  having.  How 
would  an  analogy  like  that  be  proposed?  As  the  reader  might  expect  by  now,  the 
mechanism  is  simply  some  heuristic  rule  adding  it  as  an  entry  to  the  Analogies  facet  of  a 
certain  concept.  For  example: 

IF  th«  current  task  has  just  created  a  canonical  specialization  C2  of  concept  Cl,  with  respect 
to  operations  FI  and  F2,  [i.e.,  tv/o  members  of  C2  satisfy  FI  iff  they  satisfy 
F2], 

THEN  add  the  following  entr>  to  the  Analogies  facet  of  C2: 

<C1  Kl  Operations*on-and-lnto(CI)> 

<C2  F2  Those  operations  restricted  to  C2’s> 

After  generalizing  "Equality"  into  the  operation  "Same-length",  AM  seeks  to  find  a 

canonical^^  representation  for  Bags.  That  is,  AM  seeks  a  canonizing  function  f,  such  that 
(for  any  two  bags  x,y) 

Ssme-lengih(x,y}  iff  Equal(  f(x),  }(y}  ). 

Then  the  range  of  f  would  delineate  the  set  of  "canonical"  Bags.  AM  finds  such  an  f  and 
such  a  set  of  canonical  bags:  the  operation  f  involves  replacing  each  element  of  a  bag  by 
"T",  and  the  canonical  bags  are  those  whose  elements  are  all  T’s.  In  this  case,  the  above 
rule  triggers,  with  CI-Bags,  C2*‘Bags-of-T’s,  FUSame-length,  F2»Equality,  and  the  analogy 
which  is  produced  is  the  one  shown  as  example  #3  above. 

The  Analogy  facets  are  not  implemented  in  full  generality  in  the  existing  LISP  version  of 
AM,  and  for  that  reason  I  shall  refrain  from  delving  deeper  into  their  format.  Since  good 
research  has  already  been  done  on  reasoning  by  analogy^®,  I  did  not  view  it  as  a  central 
feature  of  my  work.  Very  little  space  will  be  devoted  to  it  in  this  document. 

An  important  type  of  analogy  which  was  untapped  by  AM  was  that  between  heuristics.  If 
two  situations  were  similar,  then  conceivably  the  heuristics  useful  in  one  situation  might  be 
useful  (or  have  useful  analogues)  in  the  new  situation.  Perhaps  this  is  a  viable  way  of 
enlarging  the  known  heuristics.  Such  "meta-level"  activities  were  kept  to  a  minimum 
throughout  AM,  and  this  proved  to  be  a  serious  limitation. 

Let  me  stress  that  the  failure  of  the  Intuitions  facets  to  be  nontrivial  was  due  to  the  lack  of 
spontaneity  which  they  possessed.  Analogies  facets  were  useful  and  "fair"  since  their  uses 
were  not  predetermined  by  the  author. 
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Basically,  facet  Conjee  of  concept  C  is  a  list  of  relationships  which  involve  C.  We  shall 
discuss  its  semantics  (uses  of  this  facet)  before  its  syntax. 


Perhaps  the  most  obvious  use  for  this  facet  would  be  to  hold  conjectures  which  could  not 
be  phrased  simply.  Yet  it  turns  out  that  luckily  (1  think),  all  the  conjectures  "fell  out" 
naturally  as  trivial  relationships,  e.g.  simply  as  arcs  in  the  Genl/Spec/Exs/Isas  pointer 
format.  Specifically,  the  modal  conjecture  had  the  form  "the  range  of  F  is  not  just  C,  but 
actually  S". 

For  example,  AM  restricted  TIMES  to  perfect  squares,  and  noted  that  the  result  was  not 
merely  a  number  but  a  perfect  square  each  time.  The  unique  factorization  theorem  was 
noticed  similarly  (the  range  of  Prime-factorings  was  always  a  singleton,  not  merely  a  set). 

!n  all  the  cases  encountered  by  AM,  there  was  never  any  real  need  for  a  place  to  "park"  an 
awkwardly-phrased  conjecture,  because  no  awkward  conjecture  could  ever  possibly  be  noticed. 
Why  is  this  so?  AM  was  constructed  explicitly  on  the  assumption  that  all  (enough?) 
important  theorems  could  be  discovered  in  quite  natural  ways,  as  very  simple  (already- 
known)  relationships  on  already-defined  concepts.  AM  embodies  several  such  assumptions 
about  math  research;  they  are  collected  and  packaged  for  display  In  Section  7.2.6,  on 
page  162. 


What  else  might  this  facet  be  useful  for,  if  not  the  storage  of  awkwardly-worded 
conjectures?  It  might  be  a  good  place  to  store  fiimsy  conjectures:  those  which  were  strong 
enough  to  get  considered,  yet  for  which  not  much  empirical  confirmation  had  been  done. 
This  in  fact  was  one  important  role  of  this  facet. 

For  example,  AM  was  initially  told  that  there  are  two  specializations  of  Unordered- 
structures,  namely  Bags  and  Sets.  But  AM  was  not  given  any  examples  of  any  structures  at 
all.  Early  on,  it  chose  the  task  "Fillin  examples  of  Bags"  from  the  agenda.  After  filling  them 
in,  a  heuristic  lule  had  AM  consider  whether  or  not  this  concept  of  Bags  was  really  any 
more  specialized  than  the  concept  of  Unordered-struclures.  To  test  this  empirically,  AM 
tried  to  verify  whether  or  not  there  were  any  examples  of  Unordered-structures  that  were 
not  examples  of  Bags.  Failure  to  find  any  led  to  proposing  the  conjecture  "All  Unordered- 
structures  are  really  Bags".  This  could  have  been  recorded  quite  easily:  Bags  was  already 
known  to  be  specialization  of  Unordered-structure,  so  all  AM  had  to  do  was  tag  it  as  a 
generalization  as  well  (add  "Bags"  to  the  Generalizations  facet  of  the  Unordered-structures 
concept).  But  a  heuristic  rule  which  knows  about  such  equivalence  conjectures  first  asked 
whether  there  were  any  specializations  of  Unordered-structures  which  had  no  known 
examples,  and  fur  which  AM  had  not  (recently,  at  least)  tried  to  fill  in  examples.  In  fact, 
such  an  entry  was  "Sets".  So  the  conjecture  was  stored  on  the  Conjee  facet  of  Unordered- 
structures,  and  a  new  job  was  added  to  the  agenda:  "Fill  in  examples  of  Sets".  The  reason 
was  that  such  examples  might  disprove  this  flimsy  conjecture.  In  fact,  the  job  already 
existed  on  the  agenda,  so  only  the  new  reason  was  added,  and  its  priority  Wcis  boosted. 
When  such  examples  were  round,  they  did  of  course  disprove  that  conjecture:  each  set  was 
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an  Unordered-structure  and  yet  was  not  a  Bag.^® 

This  last  example  has  suggested  another  use  for  this  facet:  holding  heuristic  rules  which  are 
relevant  to  filling  in  and  checking  conjectures.  For  example,  the  Conjee  facet  of  Operations 
has  some  special  heuristics  which  look  for  certain  kinds  of  relationships  involving  any 
given  operation  (c.g.,  "Pick  any  example  F(x)»y.  See  what  interesting  statements  can  be 
made  about  y.  Then  try  to  verify  or  disprove  each  one  by  looking  at  the  values  of  ail  the 
other  known  calls  on  operation  F").  The  Conjee  facet  of  Any-concept  will  contain  knowledge 
which  is  much  more  general  in  scope  (e.g.,  "See  whether  concept  C  is  an  example  of  some 
member  of  (C.Isa).Spec").  Compose.Conjec  will  contain  more  specific  heuristics  ^.g.,  "See  if 
the  composition  AoB  is  really  no  different  from  B"). 

Given  any  concept  C,  AM  will  ripple  upwards,  locating  Isas(C),  and  collect  the  heuristics 
which  are  tacked  onto  their  Conjee  facets.  These  heuristic  rules  will  then  be  evaluated  (in 
order  of  increasing  generality),  and  some  conjectures  will  probably  be  proposed,  checked, 
discarded,  modified,  etc.  In  fact,  each  Conjee  facet  of  each  concept  can  have  two  separate 
subfacets:  Conjec.Fillin  and  Conjec.Check.  The  former  contains  heuristics  for  noticing 
conjectures,  the  second  for  verifying  and  patching  them  up. 

There  is  yet  another  use  for  this  facet,  one  of  efficiency  of  storage.  After  discovering  that 
ail  primes  except  2  are  Odd-primes,  there  is  very  little  reason  to  keep  around  Odd-primes 
as  a  separate  concept  from  Primes.  Yet  they  are  not  quite  equivalent.  Primes.Conjec  is  a 
good  place  for  AM  to  store  the  conjecture  "Primc(x)  implies  that  x-2  or  Odd(x)",  and  to 
pull  over  to  Primes  any  efficient  definition/algorithm  which  Odd-primes  might  possess 
(patching  it  up  to  work  for  "2").  and  then  destroy  the  concept  Odd-primes.  Another  way 
out  is  merely  to  destroy  "Primes",  and  make  2  a  distinguished  number  tacked  onto  the  Just- 
barcly-missed  subfacet  of  Odd-primes.Exs  (just  like  "1"  is  already). 

Here  is  another  example:  AM  discovers  that  Set-insertoSet-insert  is  the  same  as  just  Set- 
insert.  That  is,  if  you  insert  x  twice  into  a  set  S,  it’s  no  different  than  inserting  it  just  once 
(because  Sets  don’t  allow  multiple  copies  of  the  same  element).  Then  there’s  no  longer  any 
reason  for  keeping  Set-insertoSet-insert  hanging  around  as  a  separate  concept.  Instead,  just 
add  a  small  new  entry  to  Set-insert.Conjec  and  forget  that  space-consuming  composition 
forever. 

There  is  another  use  of  the  Conjee  facet:  untangling  paradoxes.  It  is  with  no  sorrow  that  I 
mention  that  this  facility  was  never  needed  by  AM:  no  genuine  contradictions  ever  were 
believed  by  AM.  What  would  one  look  like?  Suppose  a  chain  of  Spec  links  indicates  that  X 
is  a  specialization  of  Y,  and  yet  AM  finds  some  example  x  of  X  which  does  not  satisfy 
Y.Definition).  So  X  is  -  and  is  not  -  a  specialization  of  Y.  In  such  cases,  the  Conjees 
facets  of  the  concepts  involved  would  indicate  which  of  those  Spec  links  were  initially- 
supplied  (hence  unchallengabie),  which  links  were  created  based  on  formal  verifications 
(barely  challengable),  and  which  links  were  established  based  only  on  empirical  evidence 
(yes,  these  are  the  ones  which  would  then  fade  into  the  sunset).  If  it  has  to,  AM  should  be 
able-  to  recall  the  justification  for  each  new  link  it  created.  AM  can  deduce  this  by 
examining  the  Conjee  facets  of  the  concepts  involved. 
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Periodically  (at  huge  intervals)  AM  chose  a  task  of  the  form  "Check  conjees  about  C",  at 
which  time  all  the  entries  on  C.Conjec  would  be  re-examined  in  light  of  existing  data.  Some 
would  be  discarded  (perhaps  causing  some  Exs/Isa/Spec/Genl  links  to  vanish  with  them). 
Some  of  the  conjectures  might  be  believed  much  more  strongly  now  (causing  some  new  links 
to  be  recorded).  This  turned  out  to  be  a  surprisingly  ineffective  activity;  very  few  new 
revelations  were  obtained  this  way.  Ultimately,  this  kind  of  task  was  muzzled  (AM  was 
inhibited  from  doing  this). 

Theoretically,  AM  might  possess  rules  which  transformed  a  conjecture  into  a  more  efficient 
algorithm  for  an  operation,  or  which  used  the  knowledge  contained  therein  to  speed  up  an 
existing  algorithm.  Another  sophisticated  use  of  a  conjee  would  be  to  set  up  a  new 
representation  scheme  for  a  concept^®. 

Finally,  the  Conjee’s  facet  is  used  as  a  showcase,  to  highlight  some  nice  discovery  that  AM 
wants  to  display.  The  user  can  look  at  the  entries  on  each  concept’s  Conjee  facet  (after  a 
long  run)  and  get  a  better  feeling  for  AM’s  abilities.  If  there  are  several  powerful 
conjectures  listed  for  concept  C,  then  it  appears  to  the  user  that  AM  "understands"  the 
concept  much  better  than  if  C.Conjecs  is  empty. 

Let’s  recapitulate  the  uses  of  this  facet: 

1.  Store  awkwardly-phrased  conjectures:  this  wasn’t  really  useful. 

2.  Store  flimsy  conjectures:  apparent  relationships  worth  remembering,  yet  not  quite 
believed. 

3.  Hold  heuristics  which  notice  and  check  conjectures. 

4.  Obviate  the  need  for  many  similar  concepts:  Collapse  the  entire  essence  of  a  related 

concept  into  one  or  two  relationships  involving  this  one. 

5.  Untangling  paradoxes:  a  historic  record,  which  wasn’t  really  used. 

6.  Improve  existing  algorithms,  definition  testing  procedures,  representations. 

7.  Display  AM’s  most  impressive  observed  relationships  in  a  form  which  is  easily 
inspectable  by  the  user. 

The  syntax  of  this  facet  is  simply  a  list  of  conjectures,  where  each  conjecture  has  the  form 
of  a  relationship:  (R  a  b  c...d).  R  is  the  name  of  a  known  operation  (in  which  case,  abc...  are 
its  arguments  and  we  claim  that  d  is  its  value),  or  R  is  a  predicate  (and  d  is  either  True  or 
False),  or  R  is  the  name  of  a  kind  of  link  (GenI,  Spec,  Isa,  or  Exs),  and  the  claim  is  that  a 
and  b  arc  related  by  R.  Here  are  three  example  of  conjectures,  illustrating  the  possible 
formats: 

1.  (Compos#  Set-inseri  S«t”insert  Set-insert).  This  says  that  if  you  apply  the  known 
operation  Compose,  to  the  two  arguments  Set-insert  and  Set-insert,  then  the 
resultant  composition  is  indistinguishable  from  Set-insert. 

2.  (Same-size  lnsert(S,S)  S  False).  That  is,  inserting  a  set  into  itself  will  always  (for  finite 

sets)  give  you  a  set  of  a  different  length. 

3.  (Example-of  Prime-faclorinta  Function).  This  conjecture  is  the  unique  factorization 
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thcorem.  The  operation  which  takes  a  number  n,  and  finds  ail  prime  factorizations 
of  n,  is  claimed  to  be  a  function,  not  merely  a  relation.  That  is,  each  n  has 
precisely  one  such  prime  factoring. 


5.2.8.  Definitions 

A  typical  way  to  disambiguate  a  concept  from  all  others  is  to  provide  a  "definition"  for  it.^' 
Almost  every  concept  had  some  entries  initially  supplied  on  its  "Definitions"  facet.  The 
format  of  this  facet  is  a  list  of  entries,  each  one  describing  a  separate  definition.  A  single 
entry  will  have  the  following  parts: 

1.  Descriptors:  Recursive/\inear/Iterative,  Quick/Slow,  Opaque/Transparent,  Once- 
only/Early/Late,  Destructive/Nondestructive. 

2.  Relators:  Reducing  to  the  definition  of  concept  X,  Same  as  Y  except....  Specialized 

version  of  Z,  Using  the  definition  of  W,  etc. 

3.  Predicate:  A  small,  executable  piece  of  LISP  code,  to  tell  if  any  given  item  is  an 

example  of  this  concept. 

The  predicate  or  "code"  part  of  the  entry  must  be  faithfully  described  by  the  Descriptors, 
must  be  related  to  other  concepts  just  as  the  Relators  claim.  The  predicate  must  be  a  LISP 
function  which  take  argument(s)  and  return  either  T  or  NIL  (for  True/False),  depending  on 
whether  or  not  the  argument(s)  can  be  regarded  as  examples  of  the  concept. 

The  argument  "{A  B}"  should  satisfy  the  predicate  of  any  valid  definition  entry  of  the  Sets 
concept.  This  triple  of  arguments  <{A  B},  {A  C},  {A  B  C}>  should  satisfy  any  definition  of 
the  Set-union  concept,  since  the  third  is  equal  to  the  Set-union  of  the  first  two  arguments. 

Here  is  a  typical  entry  from  the  Definitions  facet  of  the  Set-union  concept: 


Descriptors:  Slow,  Recursivt,  Transparent 

Relators:  Uses  the  algorithm  for  Set-insert,  Uses  the  definition  of  Empty-set, 

Uses  the  definition  of  Set-equal,  Uses  the  algorithm  for  Some-member, 
Uses  the  algorithm  for  Set-delete,  Uses  the  definition  of  Set-union 

Code:  X  (A  B  C) 

IF  Empty-sei.Defn(A)  THEN  Set-equsl.Defn(B,C}  ELSE 
X  Some-member.Atg(A) 

A  Set-delete.Alg(X,A) 

B  $et-insert.Alg(X,8) 

Set-union.Defn(A,B,C) 
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Let  me  stress  that  this  is  just  one  entry,  from  one  facet  of  one  concept. 

The  notation  "X  ♦-  Some-member.Alg{A)"  means  that  any  one  algorithm  for  the  concept 
Some-member  should  be  accessed,  and  then  it  should  be  run  on  the  argument  A.  The  result, 
which  will  be  an  element  of  A,  is  to  be  assigned  the  name  "X".  The  effect  is  to  bind  the 
variable  X  to  some  member  of  set  A. 

In  the  actual  LISP  implementation,  the  ELSE  part  of  the  conditional  is  really  coded^^  as: 

(Set-union. Defn  (Set-delete. Alg  (SETQ  X  (Some-member. Alg  A) )  A) 
(Set-insert. Alg  X  S) 

C 

) 

This  particular  definition  is  not  very  efficient,  but  it  is  described  as  Transparent.  That 
means  it  is  very  well  suited  to  analysis  and  modification  by  AM  itself.  Suppose  some 
heuristic  rule  wants  to  gerieralize  this  definition.  It  can  peer  inside  it,  and,  e.g.,  replace  ihe 
base  step  call  on  Set-equal,  by  a  call  on  a  generalization  of  Set-equal  (say  "Same-length"®^). 

How  could  different  definitions  help  here?  Suppose  there  were  a  definition  which  first 
checked  to  see  if  the  three  arguments  were  Set-equal  to  each  other,  and  if  so  then  it 
instantly  returned  T  as  the  value  of  the  definition  predicate;  otherwise,  it  recurred  into  Set- 
union.Defn  again.  This  might  be  a  good  algorithm  to  try  at  the  very  beginning,  but  if  the 
Equality  test  fails,  we  don't  want  to  keep  recurring  into  this  definition.  This  algorithm 
should  thus  have  a  descriptor  labelling  it  ONCE-ONLY  EARLY. 

A  typical  kind  of  entry  for  the  Definitions  facet  of  an  operation  is  to  simply  call  on  the 
Algorithm:  part  of  that  same  concept.  Here  is  such  an  entry  from  the  Definitions  facet  of  the 
Set-union  concept: 


Oascripicrs:  non# 

Ralators:  Uses  (he  definition  of  Set^equel,  Uses  the  altorithm  for  Set-union 
Code:  X  (A  B  C)  Set-equ8l.Defn(C,  Set-union.Alg(A,B)) 


This  definition  is  a  trivial  call  on  the  "Algorithms"  facet  of  Set-union.  That  is,  one  way  to 
test  whether  C  is  the  set-union  of  A  and  B,  is  simply  to  run  set-union  on  A  and  B,  and 
compare  the  result  against  C.  The  descriptors  and  relators  of  the  particular  algorithm 
which  is  chosen  will  then  be  added  to  the  descriptors  and  relators  which  exist  so  far  on  this 
entry.  Note  that  the  box  above  (like  the  box  on  the  previous  page)  is  simply  one  entry  on 
the  Definitions  facet  of  the  Set-union  concept. 
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There  are  three  purposes  to  having  descriptors  and  relators  hanging  around: 

1.  For  the  benefit  of  the  user.  AM  appears  more  intelligent  because  it  can  describe  the 

kind  of  definition  it  is  using  -  and  why. 

2.  For  the  sake  of  efficiency.  When  all  AM  wants  to  do  is  to  evaluate  Set-union(A,B)i 

it’s  best  just  to  grab  a  fast  definition.  When  trying  to  generalize  Set-union,  it’s 
more  appropriate  to  modify  a  very  clean,  transparent  definition  -  even  if  it  is  a 
slow  one. 

3.  For  the  benefit  of  the  heuristic  rules.  Often,  a  left-  or  a  right-hand-side  will  ask 

about  a  certain  kind  of  definition.  For  example,  "If  a  (ransparant  dafinition  of  X 
axists,  than  try  to  spaeializa  X". 

Granted  that  Descriptors  and  Relators  are  useful,  how  do  these  "meta-level"  modifiers  get 
filled  In,  for  newly-created^’’  concepts?  All  such  powers  are  embedded  in  the  fine  structure 
of  the  heuristic  rules.  This  is  true  for  the  Algorithms  facet  as  well,  and  will  be  illustrated  in 
the  very  next  subsection. 

Let  me  pull  back  the  curtain  a  little  further,  and  expose  the  actual  implementation  of  these 
ideas  in  AM.  The  secrets  about  to  be  revealed  will  not  be  acknowledged  anywhere  else  in 
this  document.  They  may,  however,  be  of  Interest  to  future  researchers.  Each  concept  may 
have  a  cluster  of  Definition  facets.  Just  as  it  can  have  several  kinds  of  Examples  facets. 
These  include  three  types:  Necessary  and  sufficient  definitions,  necessary  definitions,  and 
sufficient  definitions.  These  three  types  have  the  usual  mathematical  meanings.  All  that 
has  been  alluded  to  before  (and  after  this  subsection)  is  the  necc&suff  type  of  definition  (x  is 
an  example  of  C  if  and  only  if  x  satisfies  C.Def/necc&suff).  Often,  however,  there  will  be  a 
much  quicker  sufficient  definition  (x  satisfies  C.Def/suf,  only  if  x  is  certainly  a  C).  Similarly, 
entries  on  C.Def/nec  are  useful  for  quickly  checking  that  x  Is  not  an  example  of  C  (to  check 
this,  it  suffices  to  verify  that  x  fails  to  satisfy  a  necessary  definition  of  C), 

So  given  the  task  of  deciding  whether  or  not  x  is  an  example  of  C,  we  have  many 
alternatives: 

1.  If  X  Is  a  concept,  see  if  C  is  a  member  of  x.ISA  (if  so,  then  x  Is  an  example  of  C). 

2.  Try  to  locate  x  within  C.Exs.  (depending  upon  the  flavor  of  subfacet  on  which  x  is 

found,  this  may  show  that  x  is  or  is  not  an  example  of  C). 

3.  If  X  is  a  concept,  ripple  to  collect  ISA’s(x),  and  see  if  C  is  a  member  of  ISA’s(x). 

4.  If  there  is  a  fast  sufficent  definition  of  C,  see  if  x  satisfies  it. 

5.  If  there  is  a  fast  necessary  definition  of  C,  see  if  x  fails  it  (if  so,  then  x  is  not  an 

example  of  C). 

6.  If  there  is  a  necessary  and  sufficient  definition  of  C,  see  whether  or  not  x  satisfies 

that  definition  (this  may  show  that  x  is  or  is  not  an  example  of  C). 

7.  Try  to  locate  x  within  C.Exs.  (depending  upon  the  flavor  of  subfacet  on  which  x  is 

found,  this  may  show  that  x  is  or  is  not  an  example  of  C). 

8.  Recur:  check  to  see  if  x  is  an  example  of  any  specialization  of  C. 

9.  Recur:  check  to  see  if  x  is  not  an  example  of  some  generalization  of  C  (if  so,  then  x 

is  not  an  example  of  C), 

In  fact,  there  is  a  LISP  function,  IS-EXAMPLE,  which  performs  those  steps  in  that  order. 
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At  each  moment,  there  is  a  timer  set,  so  even  if  there  is  a  necessary  and  sufficient  definition 
hanging  around,  it  might  run  out  of  time  before  settling  the  issue  one  way  or  the  other. 
Each  time  the  function  recurs,  the  timer  is  granted  a  smaller  and  smaller  quantum,  until 
finally  it  has  too  little  to  bother  recurring  anymore.  There  is  a  potential  overlap  of  activity: 
to  see  if  x  is  an  example  of  C,  the  function  might  ask  whether  x  is  or  is  not  an  example  of 
a  particular  generalization  of  C  (step  9,  above);  to  test  that,  AM  might  get  to  step  8,  and 
again  ask  if  x  is  an  example  of  C.  Even  though  the  timer  would  eventually  terminate  this 
fiasco  (and  even  though  the  true  answer  might  be  found  despite  this  wasted  effort)  it  is  not 
overly  smart  of  AM  to  fall  into  this  loop.  Therefore,  a  stack  is  maintained,  of  ail  concepts 
whose  definitions  the  IS-EX AMPLE  function  tried  to  test  on  argument  x.  As  the  function 
recurs,  it  adds  the  current  value  of  C  to  that  stack;  this  value  gets  removed  when  the 
recursion  pops  back  to  this  level,  when  that  recursive  call  "returns"  a  value. 


5.2.9.  Algorithms 

Earlier,  we  said  that  each  concept  can  have  any  facets  from  the  universal  fixed  set  of  25 
facets.  This  is  not  strictly  true.  Sometimes,  a  whole  class  of  concepts  will  possess  a  certain 
type  of  facet  which  no  others  may  meaningfully  have.  If  C  can  have  that  facet,  then  so  can 
any  specialization  of  C.  Typically,  there  will  be  some  concept  O  such  that  the  examples  of 
C  are  precisely  the  set  of  concepts  which  can  possess  the  new  facet.  That  is,  there  wilt  be  a 
domain  of  applicability  for  the  facet,  just  as  we  defined  such  domains  of  applicability  for 
heuristics.  For  example,  consider  the  "Domain/Range"  facet.  It  is  rneaningful  only  to 
"operations",  but  really  is  an  important  feature  of  all  operations.  Its  domain  of  applicability 
is  Operation. 

The  kinds  of  facets  -  including  all  such  limited  "Jargon"  facets  -  is  fixed  once  and  for  all. 
New  kinds  of  facets  cannot  be  conceived  and  added  by  AM  Itself.  Nor  does  AM  have  any 
control  over  the  domain  of  applicability  of  each  facet. 

If  desired,  one  can  view  all  this  in  a  more  general  light.  For  each  facet  f,  the  only  concepts 
which  can  have  entries  for  facet  f  are  examples  of  some  particular  concept  J(f)  -  the  "J" 
stands  for  "Jargon".  J(f)  is  the  domain  of  applicability  of  facet  f.  If  C  is  any  concept  which 
is  not  an  example  of  J(f),  then  it  can  never  meaningfully  possess  any  entries  for  that  facet  f. 
For  almost  all  facets  f,  J(f)  is  "Any-concept".  Thus  any  concept  can  possess  almost  any  facet. 
For  example,  J(Defn)-"Any-concept",  so  any  concept  may  have  definitions. 

There  are  a  few  more  restricted  facets.  For  example,  J(Domain/range)-"Operation".  So  only 
operations  can  have  domain/range  facets.®®  The  concept  "Sets",  which  is  not  an  operation, 
can’t  have  a  domain/range  facet. 

Similarly,  J(Algorithms)="Actives".  This  facet  is  the  subject  of  this  section.  The  Algorithms 
facet  is  present  for  ail  -  but  only  for  -  Actives  (predicates,  relations,  operations). 

The  representation  is,  as  usual,  a  list  of  entries,  each  one  describing  a  separate  algorithm  n 
single  entry  will  have  the  following  parts: 
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!.  Descriptors:  Recursive/Linear/lterative,  Quick/Slow,  Opaque/Transparent, 
Once-only/Early/Late,  Destructive/Nondcstructive. 

2.  Relators:  Reducing  to  the  algorithm  for  concept  X,  Same  as  Y  except...,  Specialized 

version  of  Z’s  algorithm,  Using  the  algorithm  for  W,  etc. 

3.  Program:  A  small,  executable  piece  of  LISP  code,  for  actually  running  C. 

Note  the  similarity  to  the  format  for  the  Definitions  facets  of  concepts.  Instead  of  a  LISP 
predicate,  however,  the  Algorithms  facets  possess  a  LISP  function  (an  executable  piece  of 
code  whose  value  will  in  general  be  other  than  True/False).  That  "program"  part  of  the 
entry  must  be  faithfully  described  by  the  Descriptors,  must  be  related  to  other  concepts  just 
as  the  Relators  claim,  must  take  argunents  and  return  values  as  specified  in  the 
Domain/Range  facet  of  C,  and  when  run  on  any  arguments,  the  resultant  <args  value>  pair 
must  satisfy  the  Definitions  facet  of  C. 

There  is  an  extra  level  of  sophistication  which  is  available  but  rarely  used  in  AM.  The 
descriptors  can  themselves  be  small  numeric-valued  functions.  For  example,  instead  of  just 
including  the  Descriptor  "Quick",  and  instead  of  just  giving  a  fixed  number  for  the  speed  of 
the  algorithm,  there  might  be  a  little  program  there,  which  looked  at  the  arguments  fed  to 
the  algorithm,  and  then  estimated  how  fast  this  algorithm  would  be.  The  main  reason  for 
not  usir."  this  feature  more  heavily  is  that  most  of  the  algorithms  are  fairly  fast,  and  fairly 
constant  In  performance.  It  would  be  silly  to  spend  much  time  recomputing  their  efficiency 
each  time  they  were  called.  If  the  algorithm  is  recursive,  this  conjures  up  even  sillier 
pictures.  The  main  reason  in  support  of  using  this  feature  is  of  course  "intelligence":  in  the 
long  run,  processing  a  little  bit  before  deciding  which  algorithm  to  run  has  to  be  the 
winning  solution.  At  the  moment,  it  is  not  yet  cost-effective. 

Here  is  a  typical  entry  from  the  Algorithms^®  facet  of  the  Set-union  concept: 


Doscriptorc:  Slow,  Recursive,  Transparent 

Relators:  Uses  the  altorithm  for  Set-insert,  Uses  the  definition  of  Empty-set, 

Uses  the  algorithm  for  Some-member,  Uses  the  algorithm  for  Set-insert, 
Uses  the  algorithm  for  Set-union 

Code:  k  (A  B) 

IF  Empty-set.Defn(A)  THEN  B  ELSE 
X  *■  Some-member.Alg(A) 

A  Set-delete.Alg(X,A) 

B  *•  Set-insert.Alg(X,B} 

Set-union.Alg(A,B) 
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Note  that  the  Descriptors  don't  say  whether  this  algorithm  is  destructive^^  or  not.  That 
means  that  this  same  algorithm  can  be  used  either  destructively  or  not,  depending  on  what 
AM  wants.  More  precisely,  it’s  up  to  the  algorithms  which  get  called  on  by  this  one.  !f  they 
ar6  all  chosen  to  be  destructive,  so  will  Set-union.  If  they  all  copy  their  arguments  first  then 
Set-union  will  not  be  destructive.  For  example,  note  how  the  algorithm  calls  on  Set- 
insert(X,B).  If  this  is  destructive,  then  at  the  end  B  will  have  been  physically  modified  to 
contain  X;  the  original  contents  of  B  will  be  lost. 

This  particular  algorithm  is  not  very  efficient,  but  it  is  described  as  Transparent.  7'hat 
means  it  is  very  well  suited  to  analysis  and  modification  by  AM  itself.  Suppose  some 
heuristic  rule  wants  to  specialize  this  algorithm.  It  can  peer  inside  it,  and,  e.g.,  replace  the 
variable  X  in  (Set-insert  X  B)  by  the  constant  "T".®* 

Why  should  AM  bother  storing  multiple  algorithms  for  the  same  concept?  Consider  this 
example  again,  of  Set-union.  Suppose  there  were  an  algorithm  which  first  checked  to  see  if 
the  two  arguments  were  Equal  to  each  other,  and  if  so  then  it  instantly  returned  one  of 
them  as  the  final  value  for  Set-union;  otherwise,  it  recurred  into  Set-union.Alg.  This  might 
be  a  good  algorithm  to  try  at  the  very  beginning,  but  if  the  Equality  test  fails,  we  don’t  want 
to  keep  recurring  into  this  definition.  This  algorithm  should  thus  have  a  descriptor 
labelling  it  ONCE-ONLY  EARLY. 

Also,  there  is  an  Iterative  algorithm  which  checks  to  see  if  A  equals  B,  and  if  so  then  it 
returns  B.  If  not,  the  algorithm  proceeds  to  check  that  A  is  shorter  than  B,  and  if  not  it 
switches  them.  Finally,  it  enters  an  iterative  loop  similar  to  the  recursive  one  above:  it 
repeatedly  transfers  an  element  from  A  to  B,  using  Some-member,  Set-delete  and  Set-insert. 
Tnis  iterative  loop  repeats  until  A  becomes  empty.  While  more  efficient  than  the  recursive 
one,  this  definition  is  less  transparent. 

An  even  more  efficient  algorithm  is  provided,  but  it  is  totally  opaque: 


Dttcriplors:  Quick,  Non-r«curtiv*,  Non-destructive,  Opaque 
Relators:  none 
Code:\(AB)  (UNION  A  B) 


This  algorithm  calls  on  the  LISP  function  "UNION"  to  perform  the  set-union.  It  is  the 
"best"  algorithm  to  choose  unless  space  is  critical,  in  which  case  a  destructive  algorithm  must 
be  chosen,  or  unless  AM  wishes  to  inspect  it  rather  than  run  it,  in  which  case  a  transparent 
one  must  be  picked. 
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All  the  details  about  understanding  the  descriptors  and  relators  are  embedded  in  the  fine 
structure  of  the  heuristic  rules.  A  left-hand-side  may  test  whether  a  certain  kind  of 
algorithm  exists  for  a  given  concept.  A  right-hand-side  which  fills  in  a  new  algorithm  must 
also  worry  about  filling  in  the  appropriate  descriptors  and  relators.  As  with  newly  created 
concepts,  such  information  is  trivial  to  fill  in  at  the  time  of  creation,  but  becomes  much 
harder  after  the  fact. 

Here  is  a  typical  heuristic  rule  which  results  in  a  new  entry  being  added  to  the  Algorithms 
facet  of  the  newly-created  concept  named  Compose-Set-Intersect&Set-Intersect: 

IF  the  task  is  to  Fillin  Algorithms  for  F, 
and  F  is  an  oxample  of  Composition 
and  F  has  a  definition  of  the  form  FiGoH, 
and  F  has  no  transparent,  nonreeursive  algorithm, 

THEN  add  a  new  entry  to  the  Algorithms  facet  of  F, 
with  Descriptors:  Transparent,  Non-recursive 

with  Relators:  Reducing  to  G.AIg  and  H.Aig,  Using  the  Definition  of  <G.Domain> 
with  Program:  X  (||<G.Domain>||,||<H.Domain>||-l,X) 

(SETQ  X  (H.AIg  ||<G.Domain>||)) 

(AND 

(<G.Oomain>.Defn  X) 

(G.AIg  X  ||<H.Domain>||-l)) 

The  intent  of  the  little  program  which  gets  created  is  to  apply  the  first  operator,  check  that 
the  result  is  in  the  domain  of  the  second,  and  then  apply  the  second  operator.  The 
expression  ||<G.Domain>il  means  find  a  domain/range  entry  for  G,  count  how  many  domain 
components  there  are,  and  form  a  list  that  long  from  randomly-chosen  variable  names 
(u,v,w,x,y,z). 

For  the  case  mentioned  above,  F  ■  Compose-Set-Intersect&Set-Intersect,  G  ■  Set-Intersect, 
and  H  -  Set-Intersect.  The  domain  of  G  is  a  pair  of  Sets,  so  ||<G.Domain>||  is  a  list  of  2 
variables,  say  (u  v).  Similarly,  !!<H.Domain>||-l  is  a  list  of  1  variable,  say  (w).  Putting  all 
this  together,  we  see  that  the  new  definition  entry  created  for  Compose-Set-Intersect&Set- 
Intersect  would  look  like  this: 


Descriptors:  Non*Recursiv«,  Transparent 

Relators:  Reducing  to  Set-lntersect.Alg,  Using  the  definition  of  Sets 

Code:  X  (u,v,w,X) 

tSETQ  X  (Set-lntersect.Alg  u  v)) 

(AND 

(Sets.Defn  X) 

(Set-lntersect.Alg  X  w) 


Let  me  make  clear  here  one  "kluge"  of  the  AM  program.  At  times,  AM  will  be  capable  of 
producing  only  a  slow  algorithm  for  some  new  concept  C.  For  example,  TIMES‘'(x)  was 
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originally  defined  by  AM  as  a  blind,  exhaustive  search  for  bags  of  numbers  whose  product 
is  X.  As  AM  uses  that  algorithm  more  and  more,  AM  records  how  slow  it  is.  Eventually,  a 
task  is  selected  of  the  form  "Fillin  new  algorithms  for  C",  with  the  two  reasons  being  that  the 
existing  algorithms  are  ail  too  slow,  and  they  are  used  frequently.  At  this  point,  AM  should 
draw  on  a  body  of  rules  which  take  a  declarative  definition  and  transform  it  into  an 
efficient  algorithm,  or  which  take  an  inefficient  algorithm  and  speed  it  up.  Doing  a  good  job 
on  Just  those  rules  would  be  a  mammoth  undertaking,  and  the  author  decided  to  omit  them. 
Instead,  the  system  will  occasionally  beg  the  user  for  a  better  (albeit  opaque)  algorithm  for 
some  particular  operation.  In  general,  the  only  requests  were  for  inverse  operations,  and 
even  then  only  a  few  of  them.  The  reader  who  wishes  to  know  more  about  rules  for 
creating  and  improving  LISF  algorithms  is  directed  to  [Darlington  and  Burstall  73].  A 
more  general  discussion  of  the  principles  involved  can  be  found  in  [Simon  721 


5.2.10.  Domain/Range 

Another  facet  possessed  only  by  active  concepts  is  Domain/Range.  The  syntax  of  this  facet 
is  quite  simple.  It  is  a  list  of  entries,  each  of  the  form  <  D|  D2...  -♦  R  >,  where  there  can  be 

any  number  of  Dj’s  preceding  the  arrow,  and  R  and  all  the  Dj’s  are  the  names  of  concepts. 

Semantically,  this  entry  means  that  the  active  concept  may  be  run  on  a  list  of  arguments 
where  the  first  one  is  an  example  of  D|,  the  second  an  example  of  D2,  etc.,  and  in  that  case 

will  return  a  value  guaranteed  to  be  an  example  of  R.  In  other  words,  the  concept  may  be 
considered  a  relation  on  the  cross-product  DjxD2X...xR.  We  shall  say  that  the  domain  of 

the  concept  is  D|XD2X...,  and  that  its  range  is  R.  Each  Dj  is  called  a  component  of  the 
domain. 

For  example,  here  is  what  the  Domain/Range  facet  of  TIMES  might  look  like: 

{ 

<  Numbers  Numbers  -*  Numbers  > 

<  Odd-numbers  Odd-numbers  -♦  Odd-numbers  > 

<  Even-Numbers  Even-Numbers  -»  Even-numbers  > 

<  Odd-numbers  Even-Numbers  -♦  Even-Numbers  > 

<  Perf-Squares  Perf-$quares  -*  Perf-Squares  > 

<  Bags-of-Numbers  -♦  Numbers  > 

} 


Here  is  what  the  Domain/Range  facet  of  Set-Union  might  look  like: 


[ 

<  Sets  Sets  -♦  Sets  > 

<  Nonempty-sets  Sets  -♦  Non-empty-sets  > 

<  Sets-of-Sets  -4  Sets  > 

} 
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The  Domain/Range  part  is  useful  for  pruning  away  absurd  compositions,  and  for 
syntactically  suggesting  compositions  and  "coalescings".  Let’s  see  what  this  means. 

Suppose  some  rule  sometime  tried  to  compose  TIMESoSet-union.  A  rule  tacked  onto 
Compose  says  to  ensure  that  the  range  of  Set-union  at  least  intersects  (and  preferably  is 
equal  to)  some  component  of  the  domain  of  TIMES.  But  there  are  no  entities  which  are 
both  sets  and  numbers^®:  e^go  this  fails  almost  instantaneously. 

This  is  too  bad,  since  there  was  probably  a  good  reason  (e.g.,  intuition)  for  trying  this 
composition.  If  the  activation  energy  (priority  of  the  current  task)  is  high  enough,  AM  will 
continue  trying  to  force  it  through.  The  failure  arose  because  Sets  could  not  be  viewed  as  if 
they  were  Numbers.  A  relevant  rule  says: 

IF  you  want  to  view  X’s  as  if  they  w-'.'e  Y’s, 

THEN  seek  an  interesting  operation  F  from  X  to  Y,  to  do  the  viewing. 

Sc  AM  had  to  locate  any  and  all  operations  whose  domain/range  had  an  entry  of  the  form 
<Sets-*Numbers>.  The  only  such  operation  known  to  AM  at  the  time  was  F«Lcngtli.  So 
the  composition  produced  was  TIMES[X,  Length(Set-union(Y,Z))]. 

Notice  that  if  the  composition  Set-unionoSet-union  is  proposed,  there  will  be  no  conflict, 
since  the  range  of  Set-union  obviously  intersects  one  component  of  the  domain  of  Set-union. 
How  can  AM  determine  the  domain/range  of  this  composition?  A  rule  tacked  onto  Compose 
indicates  that  if  F»GoH,  and  a  domain/range  entry  for  C  is  <A...X...B  -»  C>,  and  an  entry 
for  H  is  <D...E  -*  Y>,  and  Y  intersects  X,  then  an  entry  for  F’s  domain/range  is  <A...D...E...B 
-*  C>.  That  is,  the  domain  of  H  is  substituted  for  the  single  component  of  the  domain  of  G 
which  can  be  shown  to  intersect  the  range  of  H.  Purely  syntactically,  AM  can  thus  compute 
some  domain/range  entries  for  the  composition  Set-unionoSet-union. 

<  Sets  Sets  -*  Sets>  and  <  Sets  Sets  -*  Sets)  combine  to  yield  <  Sets  Sets  Sets  -*  Sets  >; 
CNon-empty-sets  Sets  -♦  Non-empty-sets)  and  <Sot8  Sets  -»  Sets)  combine  to  yield 

<Non-empty-sets  Sets  Sets  -»  Non-empty-sets); 

and  so  on.  Similarly,  one  can  compute  an  entry  for  the  domain/range  facet  of  the  previous 
composition  of  three  operations  TlMESoLengthoSet-union: 

<  Sets  Sets  Sets),  <  Sets  Numbers),  and  <  Numbers  Numbers  -*  Numbers  >  combine  to 

yield  <  Numbers  Sets  Sets  Numbers  ) 

So  when  computing  TIMES(  X,  Length(  Set-union(Y,Z))),  both  Y  and  Z  can  be  sets,  and  X 
a  number,  and  the  result  will  be  a  number. 

The  claim  was  also  made  that  Domain/Range  facets  help  propose  plausible  coalescings.  By 
"coalescing"  an  operation,  we  mean  defining  a  new  one,  which  differs  from  the  original  one 
in  that  a  couple  of  the  arguments  must  now  coincide.  For  example,  coalescing  TIMES(x,y) 
results  in  the  new  operation  F(x)  defined  as  TIMES(x,x).  Syntactically,  we  can  coa'^^ce  a 
pair  of  domain  components  of  the  domain/range  facet  of  an  operation  if  those  two  rc,main 
components  are  equal,  or  if  one  of  them  is  a  specialization  of  the  other,  or  even  if  they 
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merely  intersect.  In  the  case  of  one  related  to  the  other  by  specialization,  the  more 
specialized  concept  will  replace  both  of  them,  In  case  of  merely  intersecting,  an  extra  test  will 
have  to  be  inserted  into  the  definition  of  the  new  coalesced  operation. 

Given  this  donjain/range  entry  for  Set-insert:  <  Anything  Sets  -♦  Sets  >,  we  see  that  it  is  ripe 
for  coalescing.  Since  Sets  is  a  specialization  of  Anything,  the  new  operation  F(x),  which  is 
defined  as  Set-insert(x,x),  will  have  a  domain/range  entry  of  the  form  <  Sets  -»  Sets  >.  That 
is,  the  specialized  concept  Sets  will  replace  both  of  the  old  domain  elements  (Anything  and 
Sets).  F(x)  takes  a  set  x  and  inserts  it  into  its.'-.T.  Thus  F({a,b})«{a,b,{a,b}}.  In  fact,  this  new 
operation  F  is  very  exciting  because  it  always  ;icems  to  give  a  new,  larger  set  than  the  one 
you  feed  in  as  the  argument. 

We  have  seen  how  the  Domain/range  f.icets  can  prune  away  meaningless  coalescings,  as  well 
as  meaningless  compositions.  Any  proposed  composition  or  coalescing  will  at  least  be 
syntactically  meaningful.  If  all  compositions  are  proposed  only  for  at  least  one  good  semantic 
reason,  then  those  passing  the  domain/range  test,  and  hence  those  which  ultimately  get 
created,  will  all  be  valuable  new  concepts.  Since  almost  all  coalescings  are  semantically 
interesting,  any  of  them  which  have  a  valid  Domain/Range  entry  will  get  created  and 
probably  will  be  inte.'estii.g. 

This  facet  is  occasionally  used  to  suggest  conjectures  to  investigate.  For  example,  a  heuristic 
rule  says  that  if  the  domain/range  entries  have  the  form  <D  D  D...  -♦  genl(D)  >,  then  it’s 
worthwhile  seeing  whether  the  value  of  this  operation  doesn’t  really  always  lie  inside  D 
itself.  This  is  used  right  after  the  BagswNumbers  analogy  is  found,  in  the  following  way. 
One  of  the  Bag-operations  known  already  is  Bag-union.  The  analogy  causes  AM  to 
consider  a  new  operation,  with  the  same  algorithm  as  Bag-union,  but  restricted  to  Bags-of- 
T’s  (numbers  in  unary  representation).  The  Domain/range  facet  of  this  new,  restricted 
mutation  of  Bag-union  contains  only  this  entry:  <Bags-of-T’s  Bags-of-T’s  -»  Bags>.  Since 
Bags  Is  a  generalization  of  Bags-of-T’s,  the  heuristic  mentioned  above  triggers,  and  AM  sees 
whether  or  not  the  union  of  two  Bags-of-T’s  is  always  a  bag  containing  only  T’s.  It  appears 
to  be  so,  even  in  extreme  cases,  so  the  old  Domain/range  entry  is  replaced  by  this  new  one: 
<Bags-of-T’s  Bags-of-T’s  -♦  Bags-of-T’s>.  When  the  user  asks  AM  to  call  these  bags-of-T’s 
"numbers",  this  entry  becomes  <Numbers  Numbers  -♦  Numbers>.  In  modern  terms,  then,  the 
conjecture  suggested  was  that  the  sum  of  two  numbers  Is  always  a  number. 

To  sum  up  this  last  ability  in  fancy  language,  we  might  say  that  one  mechanism  for 
proposing  conjectures  is  the  prejudicial  belief  in  the  unlikelihood  of  asymmetry.  In  this 
case,  it  is  asymmetry  in  the  parts  of  a  Domain/range  entry  that  draws  atter'ion.  Such 
conjecturing  can  be  done  by  any  action  part  of  any  heuristic  rule;  the  Conjef  fu^et  entries 
don’t  have  a  monopoly  on  initiating  this  type  of  activity. 


5.2.11.  Worth 

How  can  we  represent  the  worth  of  each  concept?  Here  are  some  po.ssible  suggestions: 

1.  The  most  intelligent  (but  most  difficult)  solution  is  "purely  symbolically".  That  is,  an 

individualized  description  of  the  good  and  bad  points  of  the  concept;  when  it  is 
useful,  when  misleading,  etc. 

2.  A  simpler  solution  would  be  to  "standardize"  the  above  symbolic  description  once 
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and  for  all,  fixing  a  universal  list  of  questions.  So  each  concept  would  have  to 
answer  the  questions  on  this  list  (How  good  are  you  at  motivating  new  concepts?, 
How  costly  is  your  definition  to  execuw? ...).  The  answers  might  each  be  symbolic: 
e.g.,  arbitrary  English  phrases. 

3.  To  simplify  this  scheme  even  more,  we  can  assume  that  the  answers  to  each  question 

will  be  numeric-valued  functions  (i.e,  LISP  code  which  can  be  evaluated  to  yield  a 
number  between  0  and  1000).  The  vector  of  numbers  produced  by  Evaluating  all 
these  functions  will  then  be  easy  to  manipulate  (e.g.  using  dot-product,  vector- 
product,  vector-addition,  etc.),  and  the  functions  themselves  may  be  inspected  for 
semantic  content.  Nevertheless,  much  content  is  lost  in  passing  from  symbolic 
phrases  to  small  LISP  functions. 

4.  A  slight  simplification  of  the  above  would  be  to  just  store  the  vector  of  numbers 

answering  the  fixed  set  of  questions;  i.e.,  don’t  bother  storing  a  bunch  of  programs 
which  compute  them  dynamically. 

5.  Even  simpler  would  be  to  try  to  assign  a  single  "worthwhileness"  number  to  each 
concept,  in  lieu  of  the  vector  of  numbers.  Simple  arithmetic  operations  could 
manipulate  Worth  values  then.  In  some  cases,  this  linear  ordering  seems 
reasonable  ("primes"  really  are  better  than  "palindromes".)  Yet  in  many  cases  we 
find  concepts  wliich  are  too  different  to  be  so  easily  compared  (eg.,  "numbers"  and 
"angles".) 

6.  The  least  intelligent  solution  is  none  at  all;  each  concept  is  considered  equally 
worthwhile  as  any  other  concept.  This  threatens  to  be  combinatorial  dynamite. 

As  we  progress  along  the  intelligent-»-»-»trivial  dimension,  we  find  that  the  schemes  get  easier 
and  easier  to  code,  the  Worth  values  get  easier  and  easier  to  deal  with,  but  the  amount  of 
reliable  knowledge  packed  into  them  decreases. 

Initially,  scheme  *3  above  was  chosen  for  AM:  a  vector  of  numeric-valued  procedural 
answers  to  a  fixed  set  of  questions.  Here  are  those  questions,  the  components  of  the  Worth 
vectors  for  each  concept: 

1.  Overall  aesthetic  worth. 

2.  Overall  utility.  Combination  of  usefulness,  ubiquity. 

3.  Age.  How  many  cycles  since  this  concept  was  created? 

4.  Life-span.  Can  this  concept  be  forgotten  yet? 

5.  Cost.  How  much  epu  time  has  been  spent  on  this  concept,  since  its  creation? 

Notice  that  in  general  no  constant  number  can  answer  one  of  these  questions  once  and  for 
all  (consider,  e.g.,  Life-span).  Each  ‘answer’  had  to  be  a  numeric-valued  LISP  function. 

A  few  questions  which  crop  up  often  are  not  present  on  thi?  list,  since  they  can  be  answered 
trivially  using  standard  LISP  functions  (e.g.,  "How  much  space  does  concept  C  use  up?"  can 
be  found  by  calling  the  function  "COUNT"  on  the  property-list  of  the  LISP  atom  "C"). 

Another  kind  of  .,uestion,  which  was  anticipated  and  did  in  fact  come  up  frequently,  is  of 
the  form  "How  good  are  the  entries  on  facet  F  of  this  concept?",  for  various  values  of  F. 
Since  there  are  a  couple  dozen  kinds  of  facets,  this  would  mean  adding  a  couple  dozen  more 
q'.sestions  to  the  list.  The  line  must  be  drawn  somewhere.  If  too  much  of  AM’s  time  is 
drained  by  evaluating  where  it  is  already,  it  can  never  progress. 
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The  heuristic  rules  are  responsible  for  initially  setting  up  the  various  entries  on  the  Worth 
facets  of  new  concepts,  and  for  periodically  altering  those  entries  for  all  concepts,  and  for 
delving  into  those  entries  when  required. 

Recent  experiments  have  shown  (see  Experiment  1,  page  127)  there  was  little  change  in 
behavior  when  each  vector  of  functions  was  replaced  by  a  single  numeric  function  (actually, 
the  sum  of  the  values  of  the  components  of  the  "old"  vector  of  functions).  There  wasn’t  even 
too  much  change  when  this  was  replaced  by  a  single  number.  There  :uas  a  noticeable 
degradation  (but  no  collapse)  when  all  the  concepts’  numbers  were  set  equal  to  each  other 
initially. 

For  the  purposes  of  this  document,  then  (except  for  this  page  and  the  discussion  of 
Experiment  1),  we  may  as  well  assume  that  each  concept  has  a  single  number  (between  0 
and  1000)  attached  as  its  overall  "Worth"  rating.  This  number  is  set^°  and  referenced  and 
updated  by  heuristic  rules.  Experiment  1  can  be  considered  as  showing  that  a  more 
sophisticated  Worth  scheme  is  not  necessary  for  the  particular  kinds  of  behaviors  that  AM 
exhibits. 


5.2.12.  Interest 

Now  that  we  know  how  how  to  judge  the  overall  worth  of  the  concept  "Composition",  let’s 
turn  to  the  question  of  how  interesting  some  specific  composition  is.  Unfortunately,  the 
Worth  facet  really  has  nothing  to  say  about  that  problem.  The  Worth  of  the  concept 
"Compose"  has  little  effect  on  how  interesting  a  particular  composition  is:  "CountoDivisors- 
of"  is  very  interesting,  and  "InsertoMember"'”  is  less  so.  The  Worth  facets  of  those  concepts 
will  say  something  about  their  overall  value.  And  yet  there  is  some  knowledge,  some 
"features"  which  would  make  any  composition  which  possessed  them  more  interesting  than  a 
composition  which  lacked  them: 

Are  the  domain  and  range  of  the  composition  equal  to  each  other? 

Are  interesting  properties  of  each  component  of  the  composition  preserved? 

Are  undesirable  properties  lost  (i.e.,  not  true  about  the  composition)? 

Is  the  new  composition  equivalent  to  some  already-known  operation? 

These  hints  about  "features  to  look  <'or"  belong  tacked  onto  the  Composition  concept,  since 
they  modify  all  compositions.  Where  and  how  can  this  be  done? 

For  this  purpose  each  concept  -  including  "Composition"  -  can  have  entries  on  its 
"Interest”  facet.  It  contains  a  bunch  of  features  which  (if  true)  would  make  any  particular 
example  of  the  current  concept  interesting. 

The  format  foi  the  Interest  facet  is  as  follows: 

<  Conflicl-mstrix 

<F«8tur«|,  Value),  Reason),  Usod)> 

CFeature^i  Value2,  Reasun2,  Used2> 
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<F«aiur«K,  V«Iu«k,  RcnsotiK,  Ut«dK> 

> 

This  is  the  format  of  the  facet  itself,  not  of  each  entry.  The  conflict-matrix  is  special  and 
will  be  discussed  below.  Each  Feature/ Value/Reason/Used  quadruple  will  be  termed  an 
"entry"  on  the  Interest  fa^et. 

Each  "Feature,"  is  a  LISP  predicate,  indicating  whether  or  not  some  interesting  property  is 
satisfied.  The  corresponding  "Value,"  is  a  numeric  function  for  computing  just  how 
valuable  this  feature  is.  The  "Reason,"  is  a  token  (usually  an  English  phrase)  which  is 
tacked  along  and  moved  around,  and  can  be  inspected  by  the  user.  The  "Usedj"  subpart  is 

a  list  of  all  the  concepts  whose  definitions  are  known  to  incorporate^^  this  feature;  all 
examples  of  such  concepts  will  then  automatically  satisfy  this  Featurej. 

For  example,  here  is  one  entry  from  the  Interest  facet  of  Compose: 

FEATURE:  Domain(Argl  )=Rang8(Arg2) 

VALUE:  .4  ♦  .4xWorth(Domain(Argl))  ♦  .2xPriority(eurr«nt  task) 

REASON:  "The  composition  of  Argl  and  Arg2  will  map  from  a  set  back  into  that  same 
set" 

USED:  Compose-with-self'DomainsRange'Operation,  lnteresting*compose'4 

Just  as  with  Isa’s  and  Generalizations,  we  can  make  a  general  statement  about  Interest 
features: 

Any  feature  tacked  onto  the  interest  facet  of  any  member  of  ISA’sfC),  also  applies  to  C. 

That  is,  X.Intei'est  is  relevant  to  C  Iff  C  is  an  example  of  X.  For  example,  any  feature 
which  makes  an  operation  interesting,  also  makes  a  composition  interesting. 

So  we’d  like  to  define  the  function  Interests(C)  as  the  union  of  the  Interest  features  found 
tacked  onto  any  member  of  ISA’s(C).^^  But  some  of  these  might  have  already  been 
conjoined  to  a  definition,  to  form  the  concept  C  (or  a  generalization  of  C).  So  all  C’s  will 
trivially  (by  definition)  satisfy  such  features.  The  USED  subparts  can  be  employed  to  find 
such  features.  In  fact,  the  final  value  of  Interests(C)  is  the  one  computed  above,  using 
ISA’s(C),  but  after  eliminating  all  the  features  whose  USED  subparts  pointed  to  any 
member  of  ISA’s(C). 

This  covers  the  purpose  of  each  subpart  of  each  entry  on  a  typical  Interest  facet.  Now  we’re 
ready  to  motivate  the  presence  of  the  Conflict-matrices. 

Often,  AM  will  specialize  a  concept  by  conjoining  onto  its  definition  some  features  which 
would  make  any  example  of  the  concept  interesting.  So  any  example  of  this  new  specialized 
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concept  is  thus  guaranteed  tu  be  an  interesting  example  of  the  old  concept.  Sometimes, 
however,  a  pair  of  features  are  exclusive;  both  of  them  can  never  be  satisfied 
simultaneously.  For  example,  a  composition  can  also  be  interesting  if  "arg2"  is  an  operation 
from  Range(argl)  into  a  set  which  is  much  more  intere.sting  than  either  Domain(argl>  or 
Range(argl).  Clearly,  this  feature  and  the  one  shown  above  can’t  both  be  true  ("x=y"  and 
"x  much  more  interesting  than  y"  can’t  occur  simultaneously).  If  AM  didn’t  have  some 
systematic  way  to  realize  this,  however,  it  might  create  a  new  concept,  called  Interesting- 
composition,  defined  as  any  composition  satisfying  both  of  those  features.  But  then  this 
concept  will  be  vacuous:  no  operation  can  possibly  satisfy  that  over-constrained  definition; 
this  new  concept  will  have  no  examples;  it  is  the  null  concept;  it  is  trivially  forgettable. 
Merely  to  think  of  it  is  a  blot  on  AM’s  claim  to  rationality. 

The  "Confiict-matrix"  is  specified  to  prevent  many  such  trivial  combinations  from  eating  up 
a  lot  of  AM’s  time  (and,  as  usual,  it  helps  to  make  AM  appear  smarter).  If  there  are  K 
features  present  for  the  Interest  facet  of  the  concept,  then  its  conflict-matrix  will  be  a  KxK 
matrix.  In  row  i,  column  J  of  this  matrix  is  a  letter,  indicating  the  relationship  between 
features  I  and  j: 

E  Exclusive  of  each  other:  they  both  can’t  be  true  at  the  same  time. 

-♦  Implies:  If  feature  i  holds,  then  feature  j  must  hold. 

*-  Implied  by:  If  feature  j  holds,  then  so  does  feature  i. 

«  Equal.  Feature  i  holds  precisely  when  feature  j  holds. 

U  Unrelated.  As  far  as  known,  there  is  no  connection  between  them. 

These  little  relations  are  utilized  by  some  of  the  heuristic  rules.  Here  is  one  such  rule.  Its 
purpose  is  to  create  a  new,  specialized  form  of  concept  C,  if  many  examples  of  C  were 
previously  found  very  quickly. 

IF  Curr«nt«tasK  is  (Fillin  Specializations  of  C) 
and  ||C.Exampies||>30 

and  Time-spent-on-C-so-far  <  3  cpu  seconds, 
and  Interests(C)  is  not  null, 

THEN  create  a  new  concept  named  interesting-C, 

Defined  as  the  conjunction  of  C.Defn  and  the  highest-valued  member  of  Interests(C) 
which  is  U  (unrelated)  to  any  feature  USED  in  the  definition  of  C. 
and  add  the  following  task  to  the  agenda:  Fillin  examples  of  Interesting-C,  with  value 
computed  as  the  Value  subpari  of  the  chosen  feature,  for  this  reason:  "Any 
example  of  Interesting-C  is  automatically  an  interesting  example  of  C". 
and  add  "Interesting-C"  to  the  USED  subpart  of  the  entry  where  that  'aature  waii 
originally  plucked  from. 

i  If  course,  the  LISP  form  of  the  above  rule  is  really  more  detailed  about  what  to  do,  but 
ti  general  flavor  of  the  interaction  with  the  Interest  facet  should  come  across.  As  before, 
thi.  value  desired  is  not  C.Interest,  but  rather  the  post-rippling  value  Interests(C).  C.Int 
coii  “ins  a  few  features  pertaining  just  to  C’s,  but  Interests(C)  contains  many  additional 
feati:  :;s  which  are  not  limited  in  scope  to  merely  judging  C’s.  but  pertain  to  a  more  general 
class  concepts.  The  quantity  Time-spent-on-C-so-far’  is  one  component  of  the  Worth 
facet  of  C;  It  might  just  as  well  have  been  accessed  from  some  "Past-history"  record  of  AM’s 
activities.  The  numbers  in  the  rule  -  and  every  lihle  bit  of  that  rule  -  were  specified  ad 
hoc  by  '  'S  author.  This  is  true  for  each  rule  initially  present  in  AM.  As  Section  6.2  will 
discuss,  th*?  precise  numbers  don’t  drastically  affect  the  system’s  performance. 
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5.2.13.  Suggest 

This  section  describes  a  space-saving  "trick",  and  a  "fix-up”  to  undo  some  potentially  serious 
side-effects  of  that  trick.  Readers  not  interested  in  this  level  of  detail  may  skip  to  the  next 
subsection. 

AM  maintains  a  long  list  of  tasks  (the  agenda),  ordered  by  a  global  priority  rating  scheme. 
Besides  this,  AM  maintains  two  threshholds:  Do-threshhold  and  a  lower  one,  Be-threshhold. 

When  a  new  task  is  proposed,  if  its  global  priority  is  below  Be-threshhold,  then  it  won’t 
even  be  entered  on  the  agenda.  This  value  is  set  so  low  that  any  task  having  even  one 
mediocre  reason  will  make  it  onto  the  agenda. 

After  a  task  is  finished  executing,  the  top-rated  one  from  the  agenda  is  selected  to  work  on 
next  If  its  priority  rating  is  below  Do-threshhold,  however,  it  is  put  back  on  the  agenda, 
and  AM  complains  that  no  task  on  the  agenda  is  very  interesting  at  the  moment.  AM  then 
spends  a  minute  or  so  looking  around  for  new  tasks,  re-evaluating  the  priorities  of  the  tasks 
on  the  agenda  already,  etc. 

One  way  to  find  new  tasks  (and  new  reasons  for  already-existing  tasks)  is  to  evaluate  the 
"Suggest"  facets  of  all  the  concepts  in  the  system.  More  precisely,  each  Suggest  facet 
contains  some  heuristics,  encoded  into  LISP  functions.  Each  function  accepts  a  number  N 
as  an  argument  (representing  some  minimum  value  tolerable  for  a  new  task),  and  the 
function  returns  as  its  value  a  list  of  new  tasks.  These  are  then  merged  into  the  agenda,  if 
desired. 

Semantically,  each  function  is  one  heuristic  rule  for  suggesting  a  new  task  which  might  be 
very  plausible,  promising,  and  a  propos  at  the  current  time.  For  example,  here  is  one  entry 
from  the  Suggest  facet  of  Any-concept: 

IF  there  are  no  examples  for  concept  C  filled  in  so  far, 

THEN  consider  the  task  "Fiilin  examples  of  C",  for  the  following  reason:  "No  examples 
of  C  filled  in  so  far",  whose  value  is  half  of  Worth(C).  If  that  value  is  below 
argl,  then  forget  it;  otherwise,  try  to  add  to  to  the  agenda. 

The  argument  "argl"  is  that  low  numeric  value,  N,  supplied  to  the  Suggest  facet. 

This  entry  alone  will  produce  a  multitude  of  potential  tasks;  for  concepts  whose  Worth 
numbers  are  high,  or  for  which  a  task  is  already  on  the  agenda  to  fill  in  their  examples, 
these  suggested  tasks  will  be  remembered;  most  of  the  other  ones  will  typically  be  forgotten. 

One  use  of  this  facet  is  thus  to  "beef  up"  the  agenda  whenever  AM  is  discontented  with  all 
the  tasks  thereon.  At  such  a  time,  AM  may  call  on  all  the  Suggest  facets  in  the  system,  and  a 
large  volume  of  new  tasks  will  be  added  to  the  agenda.  Many  of  them  will  exist  there 
already,  but  for  different  reasons,  so  many  old  tasks’  priority  values  will  rise.  After  this 
period  of  suggesting  is  over,  the  agenda’s  highest-ranking  task  will  hopefully  have  a  higher 
value  than  any  did  before.  Also  at  this  time,  the  Be-thrcshhold  and  Do-threshhold 
numbers  are  reduced.  So  there  are  two  reasons  why  the  top  task  may  now  be  rated  higher 
than  Do-threshhold.  If  it  isn’t,  then  the  threshholds  are  lowered  again,  and  again  ail  the 
Sugg  facets  are  triggered  (this  time  with  a  lower  N  value). 
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Both  threshholds  are  raised  slightly  every  time  AM  succeeds  in  picking  and  executing  a 
task.  So  they  follow  a  pattern  of  slow  increase,  followed  by  a  sudden  decrement,  followed  by 
another  slow  increase,  etc.  This  was  intended  to  mimic  a  human’s  increasing  expectations  as 
he  makes  progress.^'*  It  also  mimics  the  way  a  human  strains  his  mind  when  an  obstacle  to 
that  progress  appears;  if  the  straining  doesn’t  produce  a  brilliant  new  insight,  he  grudgingly 
is  willing  to  reduce  his  expectations,  and  perhaps  resume  some  "old  path"  abandoned 
earlier. 

Another  use  of  this  facet  is  to  re-suggest  tasks  that  might  have  been  dropped  from  (or  never 
made  it  onto)  the  agenda,  because  they  weren’t  valued  above  Be-threshhold.  How  might  this 
work?  Suppose  that,  at  an  earlier  time,  a  task  was  proposed  but  never  made  it  onto  the 
agenda  because  Be-threshhold  was  quite  high.  Now,  suppose  Be-threshhold  is  much  lower 
(due  to  a  succession  of  failures).  If  a  Sugg  facet  re-proposes  that  same  task,  it  will  be 
accepted,  will  "stick"  onto  the  agenda  (albeit  near  the  bottom).  The  Suggest  facets  can 
reproduce  most  of  the  common  tasks,  and  try  to  stick  them  on  the  agenda  (though  usually 
for  a  mediocre  to  poor  reason).  It  will  still  usually  require  another  reason  for  such  a  task  to 
rise  to  the  very  top  of  the  agenda,  and  be  selected  and  executed. 

So  the  use  of  the  two  threshholds  is  really  an  inaesthetic  space-saving  device,  ar.d  the  role 
of  the  Suggest  facets  is  merely  to  correct  the  errors  introduced  in  this  way.  There  may  be 
no  convincing  intuitive  reason  for  having  these  facets  at  all  in  a  "just"  world. 


5.2.14.  Fillin/Check 


To  doubt  everything  doesn't  suffice;  one  must  know  why  he  doubts. 

"*•  Poincar#' 


There  is  one  more  level  of  structure  to  AM’s  representation  of  a  concept  than  the  simple 
"properties  on  a  property-list"  image.  Each  concept  consists  of  a  bunch  of  facets;  each  facet 
follows  the  format  layed  down  for  it  (and  described  in  the  preceding  several  subsections). 
Yet  each  facet  of  each  concept  can  have  two  additional  "subfacets’’  (little  slots  that  are  hung 
onto  any  desired  slot)  named  Fillin  and  Check. 

The  "Fillin’’  field  of  facet  F  of  concept  C  is  abbreviated  C.F.Fillin.  The  format  of  that 
subfield  is  a  list  of  heuristic  rules,  encoded  into  LISP  functions.  Semantically,  each  rule  in 
C.F.Fillin  should  be  relevant  to  filling  in  entries  for  facet  F  of  any  concept  which  is  a  C. 
This  substructure  is  an  implementation  answer  to  the  question  of  where  to  place  certain 
heuristic  rules. 

As  an  illustration,  let  me  describe  a  typical  rule  which  is  found  on  Compose.Examples.Fillin. 
According  to  the  last  paragraph,  this  must  be  useful  for  filling  in  examples  of  any  operation 
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which  Is  a  composition.  The  rule  says  that  if  the  composition  AoB  is  formed  from  two  very 
time-consuming  operations  A  and  B,  then  it’s  worth  trying  to  find  some  examples  of  AoB  by 
symbolic  means;  in  this  case,  scan  the  examples  of  A  and  of  B,  for  some  pair  of  examples 
x-*y  (example  of  B)  and  y-»z  (example  of  A).  Then  posit  that  x-+z  is  an  example  of  AoB. 
This  rule  applies  precisely  to  the  task  of  filling  in  examples  of  Examples(Composition). 
Thus,  it  is  relevant  to  the  task  "Fill  in  examples  of  Insertolnsert".  It  is  irrelevant  if  you 
change  the  action  (e.g.,  "Check  examples  of  Insertolnsert"),  or  if  you  change  the  facet  to  be 
dealt  with  (e.g.,  "Fill  in  algorithms  for  Insertolnsert"),  or  if  you  change  the  class  of  concept 
(e.g.,  "Fill  in  examples  of  Set-union)^^. 

As  another  illustration,  let  me  describe  a  typical  rule  which  is  found  on 
Compose.Conjec.Fillin.  It  says  that  one  potential  conjecture  about  a  given  composition  AoB 
is  that  it  is  unchanged  from  A  (or  from  B).  This  happens  often  enough  that  it’s  worth 
examining  each  time  a  new  composition  is  made.  This  rule  applies  precisely  to  the  task  of 
filling  in  conjectures  about  particular  compositions. 

The  subfacct  Any-Concept.Examples.Fillin  is  quite  large;  it  contains  all  the  known  methods 
for  filling  in  examples  of  C  (when  all  we  know  is  that  C  is  a  concept).  Here  are  a  few  of 
those  techniques^*: 

1.  Instantiate  C.Defn 

2.  Search  the  examples  facets  of  all  the  concepts  on  Generalizations(C)  for  examples  of 

C 

3.  Run  some  of  the  concepts  named  in  In-ran-of(C)  [i.e.,  operations  whose  range  is  C] 

and  collect  the  resultant  values. 

Any-Concept.Examples.Chcck  is  large  for  similr  reasons.  A  typical  entry  there  says  to 
examine  each  verified  example  of  C:  if  it  is  also  an  example  of  a  specialization  of  C,  then  it 
must  be  removed  from  C.Examplcs  and  inserted^^  into  the  Examples  facet  of  that 
specialized  concept. 

Here  is  one  typical  entry  from  Operation.Domain/Range.Check: 

IF  a  domain/ranga  antry  has  tha  form  (D  0  0...  R), 

and  all  tha  D’s  ara  equal,  and  R  is  a  ganaralization  of  D, 

THEN  it's  worth  seeing  whether  (D  D  0...  -a  D)  is  consistent  with  all  known  examples  of  tha 
operation. 

If  there  ara  no  known  examples,  add  a  task  to  the  agenda  requesting  they  be  filled  in. 

If  there  are  examples,  and  (D  D  D...  D)  is  consistent,  add  it  to  the  Domain/range  facet 
of  this  operation. 

If  there  are  some  contradicting  examples,  create  a  new  concept  which  is  defined  as  this 
operation  restricted  to  (D  D  D...  -*  0). 
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Note  that  this  "Checking"  rule  doesn’t  just  passively  check  the  designated  facet;  it  actively 
"fixes  up"  faulty  entries,  adds  new  tasks,  creates  new  concepts,  etc.  All  the  check  rules  are 
very  aggressive  in  this  way.  For  example,  one  entry  on  No-multiple-elements- 
structure.Examples.Check  will  actually  remove  any  multiple  occurrences  of  an  element  from 
a  structure. 

As  you  might  expect,  the  set  Checks(C.F)  of  all  relevant  rules  for  checking  facet  F  of 
concept  C  is  obtained  as  (lSA’s(C)).F.Check.  That  is,  look  for  the  Check  subfacet  of  the  F 
facet  of  all  the  concepts  on  ISA’s(C)).  Similarly,  Fillins(C.F)  is  the  union  of  the  Fillin 
subfacets  of  the  F  facets  of  all  the  concepts  on  ISA’s(C). 

When  AM  chooses  a  task  like  "Fillin  examples  of  Primes",  its  first  action  is  to  compute 
Fillins(Primcs.Exs).  It  does  this  by  asking  for  ISA’s(Primes);  that  is,  a  list  of  all  concepts  of 
which  Primes  is  an  example.  This  list  is:  <Objects  Any-concept  Anything>.  So  the  relevant 
heuristics  are  gathered  from  Objects.Exs.Fillin,  etc.  This  list  of  heuristics  is  then  executed, 
in  order  (last  executed  are  the  heuristics  attached  to  Anything.Exs.Fillin). 

It  should  now  be  clear  what  is  meant  when  a  concept’s  facets  are  listed  in  the  following 
format: 


Nam«(t)  . 

• 

Frob,  Frobnalion 

f 

Algorithms 

A1  A2 

Examples 

Ei  E2  E3  E4  E5  E6 

Fillin 

Rulel  Rule2 

Check 

Rules  Rule4  RuleS 

Oomain/range 

DRl  DR2  DR3 

Cheek 

Rules 

Conjees 

Cl  C2  C3  C4  C5  C6 

Fillin 

Rule?  Rules 

Check 

• 

P 

Rule9  Rule  10 

E.g.,  the  entry  Rule9  is  a  heuristic  rule  which  may  help  to  check  entries  on  the  Conjees  facet 
of  any  Frob^®.  This  notation  will  not  be  used  actually  in  this  document,  partly  for  the 
benefit  of  those  readers  who  skip  this  subsection,  partly  for  consistency  between  concepts 
diagrammed  before  and  after  this  subsection.  Rather,  all  the  Fillin  heuristics  for  a  concept 
will  be  gathered  together  into  what  appears  to  be  just  one  coherent  facet.  Theoretically,  of 
course,  one  could  organize  them  that  way,  with  an  extra  precondition  on  each  Fillin 
heuristic  to  indicate  which  facet  it  is  useful  for  filling  in. 
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5.2.15.  Other  Facets  which  were  Considered 

Most  facets  (like  "Definitions")  were  anticipated  from  the  very  beginning  planning  of  AM, 
and  proved  just  as  useful  as  expected.  Others  (like  "Intuitions")  were  also  expected  to  be 
very  important,  yet  were  a  serious  disappointment.  Still  others  (like  "Suggest")  were 
unplanned  and  grumblingly  acknowledged  as  necessary  for  the  particular  LISP  program 
that  bears  the  name  AM.  Finally,  we  turn  to  a  few  facets  which  were  initially  planned,  and 
yet  which  were  adjudged  useless  around  the  time  that  AM  was  coded.  They  were  therefore 
never  really  a  part  of  the  LISP  program  AM,  although  they  figured  in  its  proposal.  Let  me 
list  them,  and  explain  why  each  one  was  dropped. 

1.  UN-INTERESTINGNESS.  This  was  to  be  similar  to  the  Interest  part.  It  would  contain 

entries  of  the  form  feature/value/reason,  where  the  feature  would  be  a  bad  (dull, 
trivializing,  undesirable,  uninteresting)  property  that  an  entity  (a  concept  or  a  task) 
might  possess.  If  it  did,  then  the  value  component  would  return  a  negative  number  as  its 
contribution  to  the  worth/priority  of  that  entity.  This  sounded  plausible,  but  turned  out 
to  be  useless  in  practice:  (i)  There  were  very  few  features  one  could  point  to  which 
explicitly  indicated  when  something  was  boring;  (ii)  Often,  a  conjunction  of  many  such 
features  would  make  the  entity  seem  unusual,  hence  interesting;  (iii)  Most  entities  were 
viewed  as  very  mediocre  unless/until  specific  reasons  to  the  contrary,  and  in  those  cases 
the  presence  a  few  boring  properties  would  be  outshadowed  by  the  few  non-boring  ones. 
In  a  sea  of  mediocrity,  there  is  little  need  to  separate  the  boring  from  the  very  boring. 

2.  JUSTIFICATION.  For  conjectures  which  were  not  yet  believed  with  certainty,  this  part 

would  contain  all  the  known  evidence  supporting  hem.  This  would  hopefully  be 
convincing,  if  the  user  (or  a  concept)  ever  wanted  to  know.  In  cases  of  contradictions 
arising  somehow,  this  facet  was  to  keep  hold  of  the  threads  that  could  be  untangled  to 
resolve  those  paradoxes.  As  described  earlier,  this  duty  could  naturally  be  assumed  by 
the  Conjees  facet  of  each  concept.  The  other  intended  role  for  this  facet  was  to  hold 
sketches  of  the  proofs  of  theorems.  Unfortunately,  the  intended  concepts  for  Proof  and 
Absolute  truth  were  never  implemented,  and  thus  most  of  the  heuristic  rules  which 
would  have  interacted  with  this  facet  are  absent  from  AM.  It  simply  was  never  needed. 

3.  RECOGNITION  Originally,  it  was  assumed  that  the  location  of  relevant  concepts  and 

their  heuristics  would  be  much  more  like  a  free-for-all  (pandemonium)  than  an  orderly 
rippling  process.  As  with  the  original  use  of  BEINGs^®,  the  expectation  was  that  each 
concept  would  have  to  "shout  out"  its  relevance  whenever  the  activities  triggered  some 
recognition  predicate  inside  that  concept.  Such  predicates  were  to  be  stored  in  this  facet. 
But  it  quickly  became  apparent  that  the  triggering  predicates  which  were  the  left-hand- 
sides  of  the  heuristic  rules  were  quick  enough  to  obviate  the  need  for  pre-processing 
them  too  heavily.  Also,  the  only  rules  relevant  to  a  given  activity  ori  concept  C  always 
seemed  to  be  attainable  by  rippling  in  a  certain  direction  away  from  C.  This  varied 
with  the  activity,  and  a  relatively  small  table  could  be  written,  to  specify  which  direction 
to  ripple  in  (for  any  given  desired  activity).  We  see  that  for  "Fill-in  examples  of...",  the 
direction  to  ripple  in  is  "Generalizations",  to  locate  relevant  heuristic  rules.  For  "Judge 
interest  of..."  the  direction  is  also  generalizations.  For  "Access  specializations  of",  the 
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direction  is  Specializations,  etc.  The  only  important  point  here  is  that  the  Recognition 
facet  was  no  longer  needed. 


5.3.  AM’s  Starting  Concepts 

The  first  subsection  presents  a  diagram  of  the  top-ievel  (general)  concepts  AM  started  with, 
with  the  lines  indicating  the  Generalizations/Specializations  kinds  of  relationships  (single 
line  links)  and  a  few  Examples/Isa’s  links  (triple  vertical  lines).  Several  specific  concepts 
have  been  emitted  from  that  picture.  All  the  concepts  initially  fed  to  AM  are  then  listed 
alphabetically  and  described  in  Section  5.3.2.  A  full  facet-by-facet  description  of  each 
concept  is  provided  in  Appendix  2.  Finally,  Section  5.3.3  discusses  the  choice  of  starting 
concepts. 


5.3.1.  Diagram  of  Initial  Concepts 
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Any thing 


Any-concept  non-concepts 


The  diagram  above  represents  the  "topmost"  concepts  which  AM  had  initially,  shown 
connected  via  Specialization  links  (\)  and  Examples  links  (||p.  The  only  concepts  not 
diagrammed  are  examples  of  the  concept  Operation.  There  are  47  such  operations. 

Also,  we  should  note  that  many  entitles  exist  in  the  .system  which  are  not  themselves 
concepts.  For  example,  the  number  "3",  though  it  be  an  example  of  many  concepts,  is  not 
itself  a  concept.  All  entities  which  are  concepts  are  present  on  the  list  called  CONCEPTS, 
and  they  all  have  property  lists  (with  facet  names  as  the  properties).  In  hindsight,  this 
somewhat  arbitrary  scheme  is  regrettable.  A  more  aesthetic  designer  might  have  come  up 
with  a  more  uniform  system  of  representation  than  AM’s. 
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5.3.2.  Summary  of  Initial  Concepts 

Since  the  precise  set  of  concepts  is  not  central  to  the  design  of  AM,  or  the  quality  of 
behaviors  of  AM,  they  are  not  worth  detailing  here.  On  the  other  hand,  a  cursory 
familiarity  with  their  names  and  definitions  should  aid  the  reader  in  building  up  an 
understanding  of  what  AM  has  done.  For  that  reason,  the  concepts  will  now  be  briefly 
described,  in  alphabetical  order.  This  is  the  same  order  as  concepts  are  listed  on  page  173. 
A  fuller  description  of  the  concepts  is  provided  in  Appendix  2.  The  ordering  within  that 
appendix  is  different;  concepts  are  grouped  together  if  they  are  semantically  related,  by 
starting  at  the  top  of  the  diagram  and  meandering  downward. 

ACTIVITY  represents  something  that  can  be  "performed".  All  Actives  -  and  only  Actives  - 
have  Domain/range  facets  and  Algorithms  facets. 

ALL-BUf-FIRST-ELEMENT  is  an  operation  which  takes  an  ordered  structure  and  removes  the 
first  element  from  it.  It  is  similar  in  spirit  to  the  Lisp  function  "CDR". 

ALL-BUT-LAST-ELEMENT  takes  an  ordered  structure  and  removes  its  last  element. 

ANY-CONCEPT  is  useful  because  it  holds  all  the  very  general  tactics  for  filling  in  and 
checking  each  facet.  The  definition  of  Any-concept  is  "X  (x)  x<CONCEPTS".  'CONCEPTS’  Is 
AM’s  global  list  of  entities  known  to  be  concepts.  Initially,  this  list  contains  the  hundred  or 
so  concepts  which  AM  starts  with  (e.g.,  all  those  diagrammed  on  the  preceding  page). 

ANYTHING  is  defined  as  "X  (x)  T";  i.e.,  a  predicate  which  will  always  return  true.  Notice  that 
the  singleton  {a}  is  an  example  of  Anything,  but  (since  it’s  not  on  the  list  CONCEPTS)  it  is 
not  an  example  of  Any-concept. 

ATOM  contains  data  about  all  primitive,  indivisible  objects  (identifiers,  constants,  variables). 

BAG  is  a  type  of  structure.  It  is  unordered,  and  multiple  occurrences  of  the  same  element 
are  permitted.  They  are  isomorphic  to  the  concept  known  as  ‘multiset’,  except  that  we 
stipulate  that  sets  are  not  bags. 

BAG-DELETE  is  an  operation  which  takes  two  arguments,  x  and  B.  Although  x  can  be 
anything,  B  must  be  a  bag.  The  procedure  is  to  remove  one  occurrence  of  x  from  B. 

BAG-DIFF  is  an  operation  which  takes  two  bags  B,C.  It  repeatedly  picks  a  member  of  C,  and 
removes  it  (one  occurrence  of  it)  from  both  B  and  C.  This  continues  until  C  is  empty. 

BAG-INSERT  is  an  operation  which  adds  (another  occurrence  of)  x  into  bag  B. 

BAG-INTERSECT  takes  two  bags  B,C,  and  creates  a  new  bag  D.  An  item  occurs  in  D  the 
minimum  number  of  times  it  occurs  in  either  B  or  C. 

BAG-UNION  takes  bag  C  and  dumps  all  its  elements  into  bag  B. 

CANONIZE  is  both  an  example  of  and  a  specialization  of  ‘Operation’.  It  accepts  two 
predicates  PI  and  P2  as  arguments,  both  defined  over  some  domain  AxA,  where  PI  is  a 
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generalization  of  P2.  Canonize  then  tries  to  produce  a  "standard  representation"  for 
elements  of  A,  in  the  following  way.  It  creates  an  operation  f  from  A  into  A,  satisfying. 
Pl(x,y)  iff  P2{f(x),f(y)).  Then  any  item  of  the  form  f(x)  is  called  a  canonical  member  of  A. 
The  set  of  such  canonical-A’s  is  worth  naming,  and  it  is  worth  investigating  the  restrictions 
of  various  operations’  domains  and  ranges  to  this  set  of  canonical-A’s®°.  "Canonize" 
contains  lots  of  information  relevant  to  creating  such  functions  f  (given  PI  and  P2).  Thus 
Canonize  is  an  example  of  the  concept  Operation.  Canonize  also  contains  information 
relevant  to  dealing  with  any  and  all  such  f’s.  So  Canonize  is  a  specialization  of  Operation. 

COALESCE  admits  the  same  duality®'.  This  very  useful  operation  takes  as  its  argument  any 
operation  F(a,b,c,d...),  locates  two  domain  components  which  intersect  (preferably,  which  are 
equal;  say  the  second  and  third),  and  then  creates  a  new  operation  G  defined  as 
G(a,b,d...)sF(a,b,b,d...).  That  is,  F  is  called  upon  with  a  pair  of  arguments  equal  to  each 
other.  If  F  were  Times,  then  G  would  be  Squaring.  If  F  were  Set-insert,  then  G  would  be 
the  operation  of  inserting  a  set  S  into  itself. 

COMPOSITION  involves  taking  two  operations  A  and  B,  and  applying  them  in  sequence: 
AoB(x)*A(B(x)).  This  concept  deals  with  (i)  the  activity  of  creating  new  compositions,  given 
a  pair  of  operations;  (ii)  all  the  operations  which  were  created  in  this  fashion.  That  is  why 
this  concept  is  both  a  specialization  of  and  an  example  of  Operation. 

CONJECTURES  are  a  kind  of  object.  This  concept  knows  about  -  and  can  store  -  conjectures. 
When  proof  techniques  are  inserted  into  AM,  this  tiny  twig  of  the  tree  of  concepts  will  grow 
to  giant  proportions. 

CONSTANT-PREDICATE  is  a  predicate  which  can  afford  to  have  a  very  liberal  domain:  it 
always  ignores  its  arguments  and  just  returns  the  same  logical  value  ail  the  time. 

DELETE  is  an  operation  which  contains  all  the  information  common  to  all  flavors  of 
removing  an  element  from  a  structure  (regardless  of  the  type  of  structure  which  is  being 
attenuated).  When  called  upon  to  actually  perform  a  deletion,  this  concept  determines  the 
type  of  structure  and  then  calls  the  appropriate  specialized  delete  concept  (e.g.,  Bag-delete). 

DIFFERENCE  is  another  general  operation,  which  accepts  two  structures,  determines  their  type 
(e.g..  Bags),  and  then  calls  the  appropriate  specialized  version  of  difference  (e.g.,  Bag-diff). 

EMPTY-STRUCTURE  contains  data  relevant  to  structures  with  no  members. 

FIRST-ELEMENT  is  an  operation  which  takes  an  ordered  structuic  and  returns  the  first 
element.  It  is  like  the  Lisp  function  ‘CAR’. 

IDENTITY  is  just  what  it  claims  to  be.  It  takes  one  argument  and  returns  it  immediately.  The 
mam  purpose  of  knowing  about  this  boring  transformation  is  just  in  case  some  new  concept 
turns  out  unexpectedly  to  be  equivalent  to  it. 


50 

51 


i«,  Itkt  *n  optralnn  which  uttd  to  have  "A’  aa  on*  of  It*  domain  componanta  or  a*  it*  rang*,  and  try  to  cr*at*  a  n*w 
oparation  with  aaaantially  th*  aama  dafmition  but  whoa*  domain/ranga  aay*  'Canonical-A*  inataad  of  ’A”. 

Both  a  apacialization  of  Oparation  and  an  axampi*  of  Oparation. 


Chapter  5 


AM:  Oitcovery  in  Mithemalkt  at  Keurittic  Search 


'109- 


insert  takes  an  item  x  and  a  structure  S,  determines  S’s  type,  and  calls  the  appropriate 
flavor  of  specialized  Insertion  concept.  The  general  INSERT  concept  contains  any 
information  common  to  all  of  those  insertion  concepts. 

INTERSECT  is  an  operation  which  computes  the  intersection  of  any  two  structures.  It,  too,  has 
a  separate  specialization  for  Bags,  Sets,  Osets,  and  Lists. 

INVERT-AN-OPERATION  is  a  very  active  concept.  It  can  invert  any  given  operation.  If 
F:X-+Y  is  an  operation,  then  its  inverse  will  be  abbreviated  F"',  and  F’'(y)  is  defined  as  all 
the  x’s  in  X  for  which  F(x)-y.  The  domain  and  range  of  F"'  are  thus  the  range  and 
domain  of  F. 

INVERTED-OP  contains  information  specific  to  operations  which  were  created  as  the  inverses 
of  more  primitive  ones. 

LAST-ELEMENT  takes  an  ordered  structure  and  returns  its  final  member. 

LIST  is  a  type  of  structure.  It  is  ordered,  and  multiple  occurrences  of  the  same  element  are 
permitted.  Lists  are  also  called  vectors,  tuples,  and  obags  (for  "ordered  bags"). 

LIST-DELETE  is  an  operation  which  takes  two  arguments,  x  and  B.  Although  x  can  be 
anything,  B  must  be  a  list.  The  procedure  is  to  remove  the  first  occurrence  of  x  from  B. 

LIST-DIFF  is  an  operation  which  takes  two  lists  B,C.  It  repeatedly  picks  a  member  of  3,  and 
removes  it  (the  first  remasning  occurrence  of  it)  from  both  B  and  C.  This  continues  until 
there  are  no  more  members  in  C. 

LIST-INSERT  is  an  operation  which  adds  (another  occurrence  of)  x  onto  the  front  of  list  B.  It 
is  like  the  Lisp  function  'CONS’. 

LIST-INTERSECT  takes  two  lists  B,C,  and  creates  a  new  list  D.  An  Item  occurs  in  D  the 
minimum  number  of  times  it  occurs  in  either  B  or  C.  D  is  arranged  In  order  as  (a  sublist 
of)  list  B. 

LIST-UNION  takes  list  C  glues  it  onto  the  end  of  list  B.  It’s  like  ‘APPEND’  in  Lisp. 

LOGICAL-RELATION  contains  knowledge  about  Boolean  combinations:  disjunction, 
conjunction,  implication,  etc. 

MULTIPLE-ELEMENTS-STRUCTURES  are  a  specialization  of  Structure.  They  permit  the  same 
atom  to  occur  more  than  once  as  a  member,  (e.g.,  Bags  and  Lists) 

NO-MULTIPLE-ELEMENTS-FTRUCTURES  are  a  specialization  of  Structure.  They  permit  the 
same  atom  to  occur  only  once  as  a  member,  (e.g..  Sets  and  Osets) 

NONEMPTY-STRUCTURES  arc  a  spctiaiiiation  of  Structure  also.  They  contain  data  about  all 
structures  which  have  some  members. 

OBJECT  is  a  general,  static  concept.  Objects  are  like  the  subjects  and  direct  objects  in 
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sentences,  while  the  Actives  are  iike  the  verbs®^. 

OBJECT-EQUALITY  is  a  predicate.  It  takes  a  pair  of  objects,  and  returns  True  if  (i)  they  are 
identical,  or  (ii)  they  are  structures,  and  each  corresponding  pair  of  members  satisfies 
Object-Equality.  Often  we’ll  call  this  ‘Equal’,  and  denote  it  as 

OPERATIONS  are  Actives  which  take  arguments  and  return  a  value.  While  a  predicate 
examines  its  arguments  and  returns  either  True  or  False,  an  operation  examines  its 
arguments  and  returns  any  number  of  values,  of  varying  types.  Assuming  that  the 
arguments  lay  in  the  domain  of  the  operation  (as  specified  by  some  entry  on  its 
Domain/range  facet),  then  every  value  returned  must  lie  within  its  range  (as  specified  by 
that  same  Domain/range  entry). 

ORDERED-PAIR  is  a  kind  of  List.  It  has  just  two  ‘slots’,  hov/ever:  a  front  and  ;i  rear  element. 

ORDERED-STRUCTURE  is  a  specialized  type  of  Structure.  It  includes  all  structures  for  which 
the  order  of  insertion  of  two  members  can  make  a  difference  in  whether  the  structures  are 
equal  or  not.  Ordered-structures  are  those  for  which  it  makes  sense  to  talk  about  a  front 
and  a  rear,  a  first  element  and  a  last  element. 

OSET  is  a  type  of  structure.  It  is  ordered,  and  multiple  occurrences  of  the  same  element  are 
not  permitted  The  short-term-memory  of  Newell’s  PSG  [Newell  73)  is  an  Oset,  as  is  a 
cafeteria  line.  Not  much  use  was  found  for  this  concept  by  AM. 

OSET-DELETE  remove  x  from  oset  B  (if  x  was  in  B). 

OSET-DIFF  IS  an  operation  which  takes  two  osets  B,C.  It  removes  each  member  of  C  from  B. 

OSET-INSERT  is  an  operation  which  adds  x  to  the  front  of  oset  B.  If  x  was  in  B  previously, 
it  is  simply  moved  to  the  front  of  B. 

OSET-INTERSECT  takes  two  osets  B,C,  and  removes  from  B  any  items  which  are  not  in  C  as 
well.  B  thus  ‘induces’  the  ordering  on  the  resultant  oset. 

OSET-UNION  takes  oset  C,  removes  any  elements  in  B  already,  then  glues  what’s  left  of  C 
onto  the  rear  of  B. 

PARALLEL- JOIN  is  an  operation  which  takes  a  kind  of  structure  and  an  operation  H.  It 
creates  a  new  operation  F,  whose  domain  is  that  type  of  structure.  For  any  such  structure  S, 
F(S)  is  computed  by  appending  together  H(x)  for  each  member  x  of  S. 

PARALLEL-J0IN2  is  a  similar  operation.  It  creates  an  operation  F  with  two  structural 
arguments.  F(S,L)  is  computed  by  appending  the  values  of  H(x,L),  as  x  runs  through  the 
elements  of  S.®® 
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PARALLEL-REPLACE  is  an  operation  used  to  synthesize  new  substitution  operations.  It  takes 
a  structural  type  and  an  operation  H  as  its  arguments,  and  creates  a  new  operation  F.  F(S) 
is  computed  by  simply  replacing  each  member  x  of  S  by  the  value  of  F(x).  The  operation 
produced  is  very  much  like  the  Lisp  function  MAPCAR. 

PARALLEL-REPLACE2  is  a  slightly  more  general  operation.  It  creates  F,  where  F(S,L)  is 
computed  by  replacing  each  x<S  by  F(x,L). 

PREDICATES  are  actives  which  examine  their  arguments  and  then  return  T  or  NIL  (True  or 
False).  It  is  only  due  to  the  capriciousness  of  AM's  initial  design  that  predicates  are  kept 
distinct  from  operations.  Of  course,  each  example  of  an  operation  can  be  viewed  as  if  it 
were  a  predicate;  if  F;A-*B  is  any  operation  from  A  to  B,  then  we  can  consider  F  a  relation 
on  AxB,  that  Is  a  subset  of  AxB,  and  from  there  pass  to  viewing  F  as  a  (characteristic) 
predicate  F:AxB-+{T,F}.  Similarly,  any  predicate  on  Ax...xBxC  may  be  considered  an 
operation  (a  multi-valued,  not-always-defined  function)  from  Ax...xB  into  C.  There  are  no 
unary  predicates.  If  there  were  one,  say  P:A-+{T,F},  then  that  predicate  would  essentially  be 
a  new  way  to  view  a  certain  subset  of  A;  the  predicate  would  then  be  transformed  into 
{a<A|P(a)},  made  into  a  new  concept,  tagged  as  a  specialization  of  A,  and  its  definition 
would  be  [A.Defn(a)  a  P(a)]'*. 

PROJECTION!  is  a  simple  operation.  It  is  defined  as  \  (x  y)  x.  Notice  that  Identity  is  just  a 
specialized  restriction  of  Projl.  ProJl(Me.You)«Me. 

PR0JECTI0N2  js  a  similar  operation.  It  is  defined  as  \  (x  y)  y. 

RELATION  is  ati.  Active  which  has  been  encapsulated  into  a  set  of  ordered  pairs.  'Relation' 
bridges  the  gap  between  active  and  static  concepts. 

REPEAT  is  an  operation  for  generating  new  operations  by  repeating  old  ones.  Given  as  its 
argument  a  structural  type  SS  and  an  existing  operation  H  (with  domain  and  range  of  the 
form  SSxSS-»SS),  Repeat(SS,H)  synthesizes  a  brand  new  operation  F.  The  domain/range  of 
F  is  Just  that  of  H.  F(S)  is  computed  by  repeating  TEMP4-H(x,TEMP)  for  each  element  x 
of  S.  TEMP  is  initialized  as  some  member  (preferably  the  first  element)  of  S. 

REPEAT2  is  similar,  but  requires  that  H  take  three  arguments,  and  It  creates  F,  where  F(S,L) 
is  gotten  by  repeatedly  doing  TEMP«-H(x,TEMP,L). 

RESTRICT  is  an  operation  which  turns  out  new  operations.  Given  an  argument  operation  (or 
predicate)  F,  the  synthesized  concept  would  have  the  same  definition  as  F,  but  would  have 
its  domain  and/or  range  curtailed. 

REVERSE-ORDERED-PAIR  transforms  the  ordered  pair  <x,y>  into  <y,x>. 

SET  is  a  type  of  structure.  It  is  unordered,  and  multiple  occurrences  of  the  same  element  are 
not  permitted. 

SET-DELETE  is  an  operation  which  takes  two  arguments,  x  and  B.  Although  x  can  be 
anything,  B  must  be  a  set.  The  procedure  Is  to  remove  x  from  B  (if  x  was  in  B),  then 
icturn  th**  resultant  value  of  B. 
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SET-DIFF  Is  an  operation  which  ta^es  two  sets  B,C.  It  removes  each  member  of  C  from  B. 

SET-INSERT  is  an  operation  which  adds  x  to  set  B. 

SET-INTERSECT  removes  from  set  B  any  items  which  are  not  in  set  C,  too. 

SET-UNION  dumps  into  B  ail  the  members  of  C  which  weren’t  in  there  already. 

STRUCTURE,  the  antithesis  of  ATOM,  is  inherently  divisible.  A  structure  is  something  that 
has  members,  that  can  be  broken  into  pieces.  There  are  two  questions  one  can  ask  about 
any  kind  of  structure:  Is  it  ordered  or  not?  Can  there  be  multiple  occurrences  of  the  same 
element  in  it  or  not?  There  are  four  sets  of  answers  to  these  two  questions,  and  each  of  the 
four  specifies  a  weli-kncwn  kind  of  structure  (Sets,  Lists,  Osets,  Bags). 

STRUCTURE-OF-STRUCTURES  is  a  specialization  of  Structure,  representing  those  structures  all 
of  whose  membeis  are  themselves  structures. 

TRUTH-VALUE  is  a  specialized  kind  of  atomic  object.  Its  only  examples  are  True  and  False. 
This  concept  is  the  range  set  for  all  predicates. 

UNION  is  a  general  kind  of  Joining  operation.  It  takes  two  structures  and  combines  them. 
Four  separate  variants  of  this  concept  are  given  to  AM  initially  (e.g..  Set-union). 

UNORDERED-STRUCTURE  is  a  specialized  type  of  Structure.  It  includes  all  structures  for 
which  the  order  of  insertion  of  two  members  never  makes  any  difference  in  whether  the 
structures  are  equal  or  not.  Unordered-structures  cannot  be  said  to  have  a  front  or  a  rear,  a 
first  element  or  a  last  element. 


A  necessary  part  of  realizing  AM  was  to  choose  a  particular  set  of  starting  concepts.  But 
how  should  such  a  choice  be  made? 


My  first  impulse  was  to  gather  a  complete  set  of  concepts.  That  is,  a  basis  which  would  be 
sufficient  to  derive  all  mathematics.  The  longer  1  studied  this,  the  larger  the  estimated  size 
of  this  basis  grew.  It  immediately  became  clear  that  this  would  never  fit  in  256k.  One 
philosophical  problem  here  is  that  future  mathematics  may  be  inspired  by  some  real-world 
phenomena  which  haven’t  even  been  observed  yet.  Aliens  visiting  Earth  might  have  a 
different  mathematics  from  ours,  since  their  collective  life  experiences  could  be  quite 
different  from  we  Terrans. 

Scrapping  the  idea  of  a  sufficient  basis,  what  about  a  necessary  one?  That  is,  a  basis  which 
would  be  minimal  in  the  following  sense:  if  you  ever  removed  a  concept  from  that  basis,  it 
could  never  be  re-discovered.  In  isolated  cases,  one  can  tell  when  a  basis  is  not  minimal:  if 
it  contains  both  addition  and  multiplication,  then  it  is  too  rich,  since  the  latter  can  be 
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derived  from  the  former.^®  ^nd  yet,  the  same  problem  about  "absoluteness"  cropped  up: 
how  can  anyone  claim  tha'  t'le  discovery  of  X  can  never  be  made  from  a  given  starting 
point?  Until  recently,  mathemaf'cians  didn’t  realize  how  natural  it  was  to  derive  numbers 
and  arithmetic  from  set  theory  task  which  AM  does,  by  the  way)^®.  So  50  years  ago  the 
concepts  of  set  theory  and  number  theory  would  both  have  been  undisputedly  placed  into  a 
"minimal"  basis.  There  are  thus  no  absolute  conceptual  primitives;  each  culture  (perhaps 
even  each  individual)  possesses  its  own  basis. 

Since  I  couldn’t  give  AM  a  minimal  basis,  nor  a  complete  one,  I  decided  AM  might  as  well 
have  a  nice  one.  Although  it  can  never  be  minimal,  it  should  nevertheless  be  made  very 
small  (order  of  magnitude:  100  concepts).  Although  it  can  never  be  complete,  it  should 
suffice  for  re-discovering  muci  of  already-known  mathematics.  Finally,  it  should  be 
rational,  by  which  I  mean  that  there  should  be  a  simple  rule  for  deciding  which  concepts  do 
and  don’t  belong  in  that  basis. 

The  concepts  AM  starts  with  are  meant  to  be  those  possessed  by  young  children  (age  4,  say). 
This  explains  some  omissions  of  concepts  which  would  otherwise  be  considered 
fundamental:  (i)  Proof  and  techniques  for  proof/disproof;  (ii)  Abstract  properties  of 
relations,  like  associativity,  single-valued,  onto;  (iii)  Cardinality,  arithmetic;  (iv)  Infinity, 
continuity,  limits.  The  interested  reader  should  see  [Piaget  55]  or  [Copeland  70]. 

Because  my  programming  time  and  the  PDP-lO’s  memory  space  were  both  quite  small,  only 
a  small  percentage  of  these  'pre-numerical’  concepts  could  be  included.  Some  unjustified 
omissions  are:  (i)  visual  operations,  like  rotation,  coloration;  (ii)  Games,  rules,  procedures, 
strategies,  tactics;  (iii)  Geometric  notions,  e.g.,  outside  and  between. 

A  M  is  not  supposed  to  be  a  model  of  a  child,  however.  It  was  never  my  intention  (and  it 
would  be  much  too  hard  for  me)  to  try  to  emulate  a  human  child’s  whimsical  imagination 
and  emotive  drives.  And  AM  is  not  ripe  for  "teaching",  as  are  children.®’^  Also,  though  it 
possesses  a  child’s  ignorance  of  most  concepts,  AM  is  given  a  large  body  of  sophisticated 
"adult"  heuristics.  So  perhaps  a  more  faithful  image  is  that  of  Ramanujan,  a  brilliant 
modern  mathematician  who  received  a  very  poor  education,  and  was  forced  to  re-derive 
much  of  known  number  theory  all  by  himself.  Incidentally,  Ramanujan  never  did  master 
the  concept  of  formal  proof. 

There  is  no  formJ  justification  for  the  particular  set  of  starting  concepts.  They  are  all 
reasonably  primitive  (sets,  composition),  and  lie  several  levels  "below"  the  ones  which  .Am 
managed  to  ultimately  derive  (prime  factorization,  square-root),  h  rr-ighc  De  valuable  to 
attempt  a  similar  automated  math  discoverer,  whi'-h  began  with  a  very  different  set  of 
concepts  (eg.,  start  it  out  as  an  pvpcrl  in  lattice  theory,  possessing  all  known  concepts 
thereof).  The  converge  kind  of  experiments  are  to  vary  the  initial  base  of  concepts,  and 
observp  the  ertects  on  AM’s  behavior.  A  few  experiments  of  that  form  are  described  in 
bection  6.2. 
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The  "new  math"  la  trying  to  gat  young  childran  to  do  thit  at  wall,  unfortunataly,  no  ona  thowad  the  elementary-tchool 
taachern  the  underlying  harmony,  and  the  raaulta  have  bean  saddening 

Learning  ptychologiatt  might  label  AM  at  nao-bahaviorittic  and  cognitiviatic.  See  [LtFrancoit] 


Chapter  6.  Results 


This  chapter  opens  by  summarizing  what  AM  "did".  Section  1  gives  a.  fairly  high-level 
description  of  the  major  paths  which  were  explored,  the  concepts  discovered  along  the  way, 
the  relationships  which  were  noticed,  and  occasionally  the  ones  which  "should"  have  been 
but  but  weren’t. 

The  next  section  (6.2)  continu.’s  this  exposition  by  presenting  the  results  of  experiments 
which  were  done  with  iand  on)  AM. 

Chapter  7  will  draw  upon  these  results  -  and  others  given  in  the  appendices  -  to  form 
conclusions  about  AM.  Several  meta-level  questions  will  be  tackled  there  (e.g,,  "What  are 
AM’s  limitations?"). 


Now  we  have  seen  (hat  mathematical  work  is  not  simply  mechanical,  that  it  could 
not  be  done  by  a  machine,  however  perfect.  It  is  not  merely  a  question  of  applying 
rules,  of  making  the  most  combinations  possible  according  to  certain  fixed  laws. 
The  combinations  so  obtained  would  be  exceedingly  numerous,  useless,  and 
cumbersome.  The  true  work  of  the  inventor  consists  in  choosing  among  these 
combinations  so  as  to  eliminate  the  useless  ones  or  rather  to  avoid  the  trouble  of 
making  them,  and  the  rules  which  must  guide  this  choice  are  extremely  fine  and 
delicate.  It  is  almost  impossible  to  state  them  precisely;  they  are  felt  rather  than 
formulated.  Under  these  conditions,  how  imagine  a  sieve  capable  of  applying 
them  mechanically? 

Poincare’ 


AM  is  both  a  mathematician  of  sorts,  and  a  big  computer  program. 

By  granting  AM  more  anthropomorphic  qualities  than  it  deserves,  we  can  describe  its 
progress  through  elementary  mathematics.  It  rediscovered  many  well-known  concepts,  a 
couple  interesting  but  not-generally-known  ones,  and  several  concepts  which  were  hitherto 
unknown  and  should  have  stayed  that  way.  Section  1.3,  on  page  10,  recaps  what  AM  did, 
much  as  a  historian  might  critically  evaluate  Euler’s  work.  A  more  detailed  prose 
description  of  everything  AM  did  is  found  in  Appendix  5.1,  beginning  on  page  287. 


Instead  of  repeating  any  of  this  descriptive  prose  here,  Section  6.1.!  will  provide  a  very 
brief  listing  of  what  AM  did  in  a  single  good  run,  task  by  task.  A  much  more  detailed 
version  of  this  same  list  is  found  in  Appendix  5.2,  beginning  on  page  294.  The  ta.sk 
numbers  there  correspond  to  the  numbering  below'.  These  task-by-task  listings  are  not 
complete  listings  of  every  task  AM  ever  attempted  in  any  of  its  many  runs,  but  rather  a 
trace  of  a  single,  better-than-average  run  of  the  program.®  The  reader  may  wish  to  consult 
the  brief  alphabetized  glossary  of  concept  names  in  the  last  chapter  (page  107),  or  the  more 
detailed  appendix  of  concept  descriptions  (following  page  173). 

Following  this  linear  trace  of  AM’s  behavior  is  a  more  appropriate  representation  of  what  it 
did:  namely,  a  two-dimensional  graph  of  that  same  behavior  as  seen  in  "concept-space". 
This  forms  Section  6.1.2,  and  is  found  on  page  123. 

By  under-estimating  AM’s  sophistication,  one  can  demand  answers  to  the  typical  questions 
to  ask  about  a  computer  program:  how  big  is  it,  how  much  cpu  time  does  It  use,  what 
language  it’s  coded  in,  etc.  These  arc  found  in  Section  6.1.3. 


6.1.1.  Linear  Task-bv»task  Summary  of  a  Good  Run 


1.  Fill  in  examples  of  Compose.  Failed,  but  suggested  next  task: 

2.  Fill  in  examples  of  Set-union.  Also  failed,  but  suggested: 

3.  Fill  in  examples  of  Sets.  Many  found  (e.g.,  by  instantiating  Set.Defn)  and  then  more 

derived  from  those  examples  (e.g.,  by  running  Union.Alg). 

4.  Fill  in  specializations  of  Sets  (because  it  was  very  easy  to  find  examples  of  Sets). 

Creation  of  new  concepts.  One,  INT-Sets,  is  related  to  "Singletons".  Another,  "Bl- 
Sets",  is  all  nests  of  braces  (no  atomic  elements). 

5.  Fill  in  examples  of  INT-Sets.  This  indirectly  led  to  a  rise  in  the  worth  of  Equal. 

6.  Check  all  examples  of  INT-Sets.  AH  were  confirmed.  AM  defines  the  set  of  Nonempty 

INT-Sets;  this  is  renamed  "Singletons"  by  the  user. 

7.  Check  all  examples  of  Sets.  To  check  a  couple  conjectures,  AM  will  soon  look  for 

Bags  and  Osets. 

8.  Fill  in  examples  of  Bags. 

9.  Fill  in  specializations  of  Bags.  Created  INT-Bags  (contain  just  one  kind  of  element), 

and  BI-Bags  (nests  of  parentheses). 

10.  Fill  in  examples  of  Osets. 

11.  Check  examples  of  Osets. 

12.  Fill  in  examples  of  Lists. 

13.  Check  examples  of  List.*;. 

14.  Fill  in  examples  of  All-but-first. 

15.  Fill  in  examples  of  All-but-last. 

16.  Fill  in  specializations  of  All-but-last.  Failed. 


They  do  flOt  procitoly  tn«teh  th«  t«tk  numbirt  iccomptnyinf  th«  txamplo  (ivtn  m  Chaptor  2. 

In  fact,  H  i«  ptrhapt  th«  bast  ovtrall  run.  It  occurred  in  two  ataftt  (dut  to  tpact  problatnt;  unimportant).  In  this  particular 
run,  AM  miataa  the  ftw  ’vary  boat"  diacovtritt  it  avar  made,  ainca  tha  runa  thay  occurred  in  want  in 
aomawhat  diffarant  diractiona.  It  alao  omita  aoma  of  tha  more  bonne  taaka  aaa,  a  tha  daacription  of  taak 
number  69. 
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17.  Fill  in  examples  of  List-union. 

18.  Fill  in  examples  of  Projl. 

19.  Check  examples  of  All-but-first. 

20.  Check  examples  of  All-but-last. 

21.  Fill  in  examples  of  Proj2. 

22.  Fill  in  examples  of  Empty-structures.  4  found. 

23.  Fill  in  generalizations  of  Empty-structures.  Failed. 

24.  Check  examples  of  List-union. 

25.  Check  examples  of  Bags.  Defined  Singleton-bags. 

26.  Fill  in  examples  of  Bag-union. 

27.  Check  examples  of  Proj2. 

28.  Fill  in  examples  of  Set-union. 

29.  Check  examples  of  Set-union.  Define  A  (x,y)  xuy«x,  later  called  Superset. 

30.  Fill  in  examples  of  Bag-insert. 

31.  Check  examples  of  Bag-insert.  Range  is  realty  Nonempty  bags.  Isolate  the  results  of 

insertion  restricted  to  Singletons:  call  them  Doubleton-bags. 

32.  Fill  in  examples  of  Bag-intersect 

33.  Fill  in  examples  of  Set-insert. 

34.  Check  examples  of  Set-insert.  Range  is  always  Nonempty  sets.  Define  X  (x,S)  Set- 

insert(x,S)=S;  i.c.,  set  membership.  Define  Doubleton  sets. 

35.  Fill  in  examples  of  Bag-delete. 

36.  Fill  in  examples  of  Bag-difference. 

37.  Check  examples  of  Bag-intersect.  Define  A  (x,y)  xny*();  i.e.  disjoint  bags. 

38.  Fill  in  examples  of  Set-intersect. 

39.  Check  examples  of  Set-intersect.  Define  A  (x,y)  xny-x;  i.e.,  subset.  Also  define 
disjoint  sets:  A  (x,y)  xny«{}. 

40.  Fill  in  examples  of  List-intersect. 

41.  Fill  in  examples  of  Equal.  Very  difficult  to  find  examples;  this  led  to: 

42.  Fill  in  generalizations  of  Equal.  Define  "Same-size",  "Equal-CARs",  and  some  losers. 

43.  Fill  in  examples  of  Same-size. 

44.  Apply  an  Algorithm  for  Canonize  to  the  args  Same-size  and  Equal.  AM  eventually 

synthesizes  the  canonizing  function  "Size".  AM  defines  the  set  of  canonical 
structures:  bags  of  T’s;  this  later  gets  renamed  as  "Numbers". 

45.  Restrict  the  domain/range  of  Bag-union.  A  new  operation  is  defined,  Number- 
union,  with  domain/range  entry  <Number  Number  -♦  Bag>. 

46.  Fill  in  examples  of  Number-union.  Many  found. 

47.  Check  the  domain/range  of  Number-union.  Range  is  'Number*.  This  operation  is 

renamed  "Add2". 

48.  Restrict  the  domain/range  of  Bag-intersect  to  Numbers.  Renamed  "Minimum". 

49.  Restrict  the  domain/range  of  Bag-delete  to  Numbers.  Renamed  "SUBl". 

50.  Restrict  the  domain/range  of  Bag-insert  to  Numbers.  AM  calls  the  new  operation 

"Number-insert".  Its  domain/range  entry  is  <Anything  Number  -♦  Bag>. 

51.  Check  the  domain/range  of  Number-insert.  This  doesn’t  lead  anywhere. 

52.  Restrict  the  domain/range  of  Bag-difference  to  Numbers.  This  becomes  "Subtract". 

53.  Fill  in  examples  of  Subtract.  This  leads  to  defining  the  relation  LEQ,($).® 

54.  Fill  In  examples  of  LEQ.  Many  found. 
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55.  Check  examples  of  LEQ. 

56.  Apply  algorithm  of  Coalesce  to  LEQ.  LEQ(x,x)  is  Constant-True. 

57.  Fill  in  examples  of  Parallel- join2.  Included  is  Parallel- join2(Bags, Bags, Pro j2),  which 

is  renamed  "TIMES",  and  ParalleI-join2{Structures,Structures,Projl),  a  generalized 
Union  operation  renamed  "G-Union",  and  a  bunch  of  losers. 

58.  -  69.  Fill  in  and  check  examples  of  the  operations  just  created. 

70.  Fill  in  examples  of  Coalesce.  Created:  Self-Compo.se,  Self-Insert,  Self-Delete,  Self- 
Add,  Self-Times,  Self-Union,  etc.  Also;  Coa-repeat2,  Coa-Join2,  etc. 

71.  Fill  in  examples  of  Self-Delete.  Many  found. 

72.  Check  examples  of  Self-Delcte.  Self-Delete  is  just  Identity-op. 

73.  Fill  in  examples  of  Self-Member.  No  positive  examples  found. 

74.  Check  examples  of  Self-Member.  Self-member  is  just  Constant-False. 

75.  Fill  in  examples  of  Self-Add.  Many  found  User  renames  this  "Doubling". 

76.  Check  examples  of  Coalesce.  Confirmed. 

77.  Check  examples  of  Add2.  Confirmed. 

78.  Fill  in  examples  of  Self-Times.  Many  found.  Renamed  "Squaring"  by  the  user. 

79.  Fill  in  examples  of  Self-Composc.  Defined  SquaringoSquaring.  Created  AddoAdd 
(two  versions:  Add21  which  is  >,  (x,y,z)  (x+y)+z,  and  Add22  which  is  x+(y*2)). 
Similarly,  two  versions  of  TimesoTimes  and  of  ComposeoCompose. 

80.  Fill  in  examples  of  Add21.  {x+y)+z.  Many  are  found. 

81.  Fill  in  examples  of  Add22.  x+(y+z).  Again  many  are  found. 

82.  Check  examples  of  Squaring.  Confirmed. 

83.  Check  examples  of  Add22.  Add21  and  Add22  appear  equivalent.  But  first: 

84.  Check  examples  of  Add21.  Add21  and  Add22  still  appear  equivalent.  Merge  them. 

So  the  proper  argument  for  a  generalized  "Add"  operation  is  a  Bag. 

85.  Apply  algorithm  for  Invert  to  argument  ‘Add’.  Define  Inv-add(x)  as  the  set  of  all 

bags  of  numbers  (>0)  whose  sum  is  x.  Also  denoted  Add*'(x). 

86.  Fill  in  examples  of  TIMES21.  (xy)z.  Many  are  found. 

87.  Fill  in  examples  of  TIMES22.  x(yz).  Again  many  are  found. 

88.  Check  examples  of  TIMES22.  TIMES2I  and  TIMES22  may  be  equivalent. 

89.  Check  examples  of  TIMES21.  TIMES21  and  TIMES22  still  appear  equivalent. 

Merge  them.  So  the  proper  argument  for  a  generalized  "TIMES"  operation  is  a 
Bag.  Set  up  an  analogy  between  TIMES  and  ADD,  because  of  this  fact. 

90.  Apply  algorithm  for  Invert  to  argument  TIMES’.  Define  Inv-TIMES(x)  as  the  set 

of  all  bags  of  numbers  (>1)  whose  product  is  x.  Analogic  to  Inv-Add. 

91.  Fill  in  examples  of  Parallel-replace2.  Included  are  Parall^l- 
replace2(Bags,Bags,Proj2)  (called  MR2-BBP2),  and  many  losers. 

92.  -  107.  Fill  in  and  check  examples  of  the  operations  just  created. 

108.  Fill  in  examples  of  Compose.  So  easy  that  AM  creates  Int-Compose. 

109.  Fill  in  examples  of  Int-Compose.  The  two  chosen  operations  G,H  must  be  such 
that  ran(H)<dom(G),  and  ran(G)<dom(H);  both  G  and  H  must  be  interesting. 
Create  G-UnionoMR2-BBP2,^  InsertoDelete,  TimesoSquaring,  etc. 

1 10.  -  127.  Fill  in  and  check  examples  of  the  compositions  just  created.  Notice  that  G- 
UnionoMR2-BBP2  is  just  TIMES. 

128.  Fill  in  examples  of  Coa-repeat2.  Among  them:  Coa-repeat2(Bags-of-Numbers, 
Add2)  [multiplication  again!],  Coa-repeat2(Bags-cf-Numbcrs,  Times) 
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texponentiation],  Coa-repeat2(Structures,  Projl)  [CAR],  Coa-repeat2(Structures, 
Proj2)  [Last-element-of],  etc. 

129.  Check  the  examples  of  Coa-repeat2.  All  confirmed. 

130.  Apply  algorithms  for  Invert  to  ‘Doubling’.  The  re«u!t  is  caiisd  "Halving"  by  the 
user.  AM  then  defines  "Evens" 

131.  Fill  in  examples  of  Self-Insert. 

132.  Check  examples  of  Self-Insert.  Nothing  special  found. 

133.  Fill  in  examples  of  Coa-repeat2-Add2. 

134.  Check  examples  of  Coa-repeat2-Add2.  It’s  the  same  as  TIMES. 

135.  Apply  algorithm  for  Invert  to  argument  ’Squaring’.  Define  "Square-root". 

136.  Fill  in  examples  of  Square-root.  Some  found,  but  very  inefficiently. 

137.  Fill  in  new  algorithms  for  Square-root.  Had  to  ask  user  for  a  good  one. 

138.  Check  examples  of  Square-root.  Define  the  set  of  numbers  "Perfect-squares". 

139.  Fill  in  examples  of  Coa-repeat2-Times.  This  is  exponentiation. 

140.  Check  examples  of  Coa-repeat2-Times.  Nothing  special  noticed,  unfortunately. 

141.  Fill  in  examples  of  Inv-TIMES.  Many  found,  but  inefficiently. 

142.  Fill  in  new  algorithms  for  Inv-TIMES.  Obtained  opaquely  from  the  user. 

143.  Check  examples  of  Inv-TIMES.  This  task  suggests  the  next  one: 

144.  Compose  G-Union  with  Inv-TIMES.  Good  domain/range.  Renamed  "Divisors". 

145.  Fill  in  example,'  of  Divisors.  Many  found,  but  not  very  efficiently. 

146.  Fill  in  new  algorithms  for  Divisors.  Obtained  from  the  use*". 

147.  Fill  in  examples  of  Perfect-squares.  Many  found. 

148.  Fill  in  specializations  of  TIMES.  Timesl(x)sl*x,  Times2(x)52x,  Times-sq  is  TIMES 
with  its  domain  restricted  to  bags  of  perfect  squares,  Times-ev  takes  only  even 
arguments,  Times-to-evens  requires  that  the  result  be  even,  Times-to-sq, ... 

149.  Check  examples  of  Divisors.  Define  0-Div,  1-Div,  2-Div,  and  3-Div,  the  sets  of 
numbers  whose  Divisors  value  is  the  empty  set,  a  singleton,  a  doubleton,  and  a 
tripleton,  respectively. 

150.  Fill  in  examples  of  1-Div.  Only  one  example  found:  "I".  Lower  1-Div.Worth. 

151.  Fill  in  examples  of  0-Div.  None  found.  Lower  the  worth  of  this  concept. 

152.  Fill  in  examples  of  2-Div.  A  nice  number  are  found.  Raise  2-Dlv.Wcrth. 

153.  Check  examples  of  2-Div.  All  confirmed,  but  no  pattern  noticed.  • 

154.  Fill  in  examples  of  3-Div.  A  nice  number  found. 

155.  Check  examples  of  3-Div.  All  confirmed.  All  are  perfect  squares. 

156.  Restrict  Square-root  to  numbers  which  are  in  3-Div.  Call  this  Root3. 

157.  Fill  in  examples  of  Root3.  Many  found. 

158.  Check  examples  of  Root3.  All  confirmed.  All  are  in  2-Div.  Raise  their  worths. 

159.  Restrict  Squaring  to  2-divs.  Call  the  result  Square2. 

160.  Fill  in  examples  of  Square2.  Many  found. 

161.  Check  the  range  of  Square2.  Always  3-Divs.  Conjecture:  x  has  2  divisors  iff 
has  3  divisors. 

162.  Restrict  Squaring  to  3-Divs.  Call  the  result  Square3. 

•63.  Restrict  Square-rooting  to  2-Divs.  Call  the  result  Root2. 

164.  Fill  in  examples  of  Square3.  Many  found. 

165.  Compose  Divisors-of  and  Square!  Call  the  result  Div-Sq3. 

166.  Fill  in  examples  of  Div-Sq3.  Many  found. 

167.  Check  examples  of  Div-Sq3.  All  such  examples  are  Same-size. 

168.  -  175.  More  confirmations  and  explorations  of  the  above  conjecture.  Gradually, 
all  its  ramifications  lead  to  dead-ends  (as  far  as  AM  is  concerned). 

176.  Fill  in  examples  of  Root2.  None  found.  Conjecture  that  there  arc  none. 
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177.  Check  examples  of  Inv-TIMES.  Inv-TIMES  always  contains  a  singleton  bag,  and 
always  contains  a  bag  of  primes. 

178.  Restrict  the  range  of  Inv-TIMES  to  bags  of  primes.  Call  this  Prime-Times. 

179.  Restrict  the  range  of  Inv-TIMES  to  singletons.  Called  Single-Times. 

180.  Fill  in  examples  of  Prime-times.  Many  found. 

181.  Check  examples  of  Prime-times.  Always  a  singleton  set.  User  renames  this 
conjecture  "The  unique  factorization  theorem". 

182.  Fill  in  examples  of  Single-TIMES.  Many  found. 

183.  Check  examples  of  Single-TIMES.  Always  a  singleton  set.  Single-TIMES  is 
actually  the  same  as  Bag-insert! 

184.  Fill  in  examples  of  Self-set-union.  Many  found. 

185.  Check  examples  of  Self-set-union.  This  operation  is  same  as  Identity. 

186.  Fill  in  examples  of  Self-bag-union.  Many  found. 

187.  Check  examples  of  Self-bag-union.  Confirmed.  Nothing  Interesting  noticed. 

188.  Fill  in  examples  of  Inv-ADD. 

189.  Check  examples  of  Inv-ADD.  Hordes  of  boring  conjectures,  so: 

190.  Restrict  the  domain  of  Inv-ADD  to  primes  (Inv-Add-primes),  to  evens  (Inv-Add- 
evens),  to  squares,  etc. 

191.  Fill  in  examples  of  Inv-add-primes.  Many  found. 

192.  Check  examples  of  Inv-add-primes.  Confirmed,  but  nothing  special  noticed. 

193.  Fill  in  examples  of  Inv-add-evens.  Many  found. 

194.  Check  examples  of  Inv-add-evens.  Always  contains  a  bag  of  primes. 

195.  Restrict  the  range  of  Inv-Add-evens  to  bags  of  primes.  Called  Prime-ADD. 

196.  Restrict  the  range  of  Inv-ADD  to  singletons.  Call  that  new  operation  Single-ADD. 

197.  Fill  in  examples  of  Prime-ADD.  Many  found. 

198.  Check  examples  of  Prime-ADD.  Always  a  nonempty  set  (of  bags  of  primes).  User 
renames  this  conjecture  "Goldbach’s  conjecture". 

199.  Fill  in  examples  of  Single-ADD.  Many  found. 

200.  Check  examples  of  Single-ADD.  Always  a  singleton  set.  This  operation  is  the  same 
as  Bag-Insert  and  Single-TIMES. 

201.  Restrict  the  range  of  Prime-ADD  to  singletons,  by  analogy  to  Prime-TIMES.®  Call 
the  new  operation  Primc-ADD-SING. 

202.  Fill  in  examples  of  Prime-ADD-SING.  Many  found. 

203.  Check  examples  of  Prime-ADD-SING.  Nothing  special  noticed. 

204.  Fill  in  examples  of  Times-sq.®  Many  examples  found. 

205.  Check  domain/range  of  Times-sq.  Is  the  range  actually  Perfect-squares?  Yes! 

206.  Fill  in  examples  of  Timesl.  Recall  that  Timcsl(x)*TIMES(l,x). 

207.  Check  examples  of  Timesl.  Apparently  just  a  restriction  of  Identity. 

208.  Check  examples  of  Times-sq.  Confirmed. 

209.  Fill  in  examples  of  TimesO. 

210.  Fill  in  examples  of  Times2. 

211.  Check  examples  of  Times2.  Apparently  the  same  as  Doubling.  That  is,  x+x=2»x. 
Very  important.  By  analogy,  define  Aq2(x)  as  x+2. 

212.  Fill  in  examples  of  Ad2. 

213.  Check  examples  of  Ad2.  Nothing  interesting  noticed. 
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214.  Fill  in  specializations  of  Add.  Among  those  created  are:  AddO  (x+0),  Addl,  Add3, 
ADD-sq  (addition  restricted  to  perfect  squares),  Add-ev  (sum  of  even  numbers), 
Add-pr  (sum  of  primes),  etc. 

215.  Check  examples  of  TimesO.  The  value  always  seems  to  be  0. 

216.  Fill  in  examples  of  Times-ev.^  Many  examples  found. 

217.  Check  examples  of  Times-ev.  Apparently  all  the  results  are  Evens. 

218.  Fill  in  examples  of  Times-to-ev.®  Many  found. 

219.  Fill  in  examples  of  Times-to-sq.  Only  a  few  found. 

220.  Check  examples  of  Times-to-sq.  All  arguments  always  seem  to  be  squares.  Conjee: 
Times-to-sq  is  really  the  same  as  Times-sq.  Merge  the  two.  This  is  a  false 
conjecture,  but  did  AM  no  harm. 

,  221.  Check  examples  of  Times-to-cv.  The  domain  always  contains  an  even  number. 

222.  Fill  in  examples  of  Self-Union. 

223.  Check  examples  of  Self-Union. 

224.  Fill  in  examples  of  SubSet. 

225.  Check  example  of  SubSet. 

226.  Fill  in  examples  of  SuperSet. 

227.  Check  examples  of  SuperSet.  Conjee:  Subset(x,y)  iff  Superset(y,x).  Important. 

228.  Fill  in  examples  of  ComposeoCompose-1.  AM  creates  some  explosive  combination 
(e.g.,  (ComposeoCompose)o(ComposeoCompose)o(ComposeoComposc)),  some  poor 
ones  (e.g.,  SquareoCountoADD"'),  and  even  a  few  -  very  few  -  winners  (e.g., 
SUB  loCountoSelf-lnsert). 

229.  Check  examples  of  ComposeoCompose-1. 

230.  Fill  in  examples  of  ComposeoCompose-2.®  AM  recreates  many  of  the  previous 
tasks*  operations. 

231.  Check  examples  of  ComposeoCompose-2.  Nothing  noticed  yet'®. 

232.  -  252.  Fill  in  and  check  examples  of  the  losing  compositions  just  created. 

253.  Fill  in  examples  of  Add-sq  (i.e.,  sum  of  squares). 

254.  Check  domain/range  entries  of  Add-sq.  The  range  is  not  always  perfect  squares. 
Define  Add-sq-sq(x,y),  which  is  True  iff  x  and  y  are  perfect  squares  and  their  sum 
is  a  perfect  square  as  well. 

255.  Fill  in  examples  of  Add-pr;  i.e.,  addition  of  primes. 

256.  Check  Domain/range  entries  of  Add-pr.  AM  defines  the  set  of  pairs  of  primes 
whose  sum  is  also  a  prime.  This  is  a  bizarre  derivation  of  prime  pairs. 


Rectll  that  Timt$-tv  ia  just  like  TIMES  rcatrictid  to  optrating  on  tvtn  numbtrt. 

o 

That  it,  contidtr  bagt  of  numbtrt  which  multiply  to  f  ivt  an  tvtn  numbtr. 

®  Rtcall  that  th«  difftrtnet  batwitn  thit  optration  and  th«  latt  on*  it  mtrtly  in  th«  erdtt  of  composinf:  Fo(GoH)  vtrtut 
<FoG)oK 
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On  the  next  two  pages  is  a  graph  of  the  same  "best  run"  which  AM  executed.  The  nodes 
are  concepts,  and  the  links  are  actions  which  AM  performed.  Labels  on  the  links  indicate 
when  each  action  was  taken,  so  the  reader  may  observe  how  AM  jumped  around.  It  should 
also  easy  to  perceive  from  the  graph  which  paths  of  development  were  abandoned,  which 
concepts  ignored,  and  which  ones  concentrated  upon.  These  are  precisely  the  features  of 
AM’s  behavior  which  are  awkward  to  infer  from  a  simple  linear  trace  (as  in  the  previous 
section). 

In  more  detail,  here  is  how  to  read  the  graph:  Each  node  is  a  concept.  To  save  space,  these 
names  are  often  highly  abbreviated.  For  example,  "xO"  is  used  in  place  of  "TIMES-0". 

Each  concept  name  is  surrounded  by  from  zero  to  four  numbers: 

318  288 
FROBNATION 
310  291 

The  upper  right  number  indicates  the  task  number  (see  last  section)  during  which  examples 
of  this  concept  were  filled  In.  The  lower  right  number  tells  when  they  were  checked.  The 
upper  left  number  indicates  when  the  Domain/range  facet  of  that  concept  was  modified. 
Finally,  the  lower  left  number  is  the  task  number  during  which  some  new  Algorithms  for 
that  concept  were  obtained.  A  number  in  parentheses  indicates  that  the  task  with  that 
number  was  a  total  failure. 

Because  of  the  limited  space,  it  was  decided  that  If  a  concept  were  ever  renamed  by  the 
user,  then  only  that  newer,  mnemonic  name  would  be  given  in  the  diagram.  Thus  there  is 
an  arrow  from  "Coalesce"  to  "Square”,  an  operation  originally  called  "Self-Times"  by  AM. 

Sometimes,  a  concept  will  have  under  it  a  note  of  the  form  *GR0K.  This  simply  means  that 
AM  eventually  discovered  that  the  concept  was  equivalent  to  the  already-known  concept ' 
"Grok",  and  probably  forgot  about  this  one  (merged  it  into  the  one  it  already  Knew  about). 
The  "trail"  of  discovery  may  pick  up  again  at  that  pre-existing  concept.  A  node  written  as 
«GR0K  means  that  the  concept  was  really  the  same  as  "Grok",  but  AM  never  investigated  it 
enough  to  notice  this. 

Each  node  may  have  an  arrou’  leading  into  it,  and  any  number  of  arrows  emanating  from 
it.  The  arrows  Indicate  the  creation  of  new  concepts.  Thus  an  arrow  leading  to  concept 
"Frobnate"  indicates  how  that  concept  was  created.  An  arrow  directed  away  from  Frobnate 
points  to  a  concept  created  as,  e.g.,  a  specialization  or  an  example  of  Frobnate.  No 
arrowheads  are  in  practice  necessary:  all  arrows  are  directed  downwards. 

The  arrows  may  be  labelled,  indicating  precisely  what  they  represent  (e.g.,  composition, 
restriction)  and  what  the  task  number  was  when  they  occurred.  For  space  reasons,  the 
following  convention  has  proven  necessary:  if  an  arrow  emanating  from  C  is  un-numbered, 
it  is  assumed  to  have  occurred  at  the  same  time  as  the  arrow  to  its  immediate  left  which  also 
points  from  C;  if  all  the  arrows  emanating  from  C  have  no  number,  than  all  their  times  of 
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occurrence  are  assumed  to  be  the  lower  right^^  number  of  C.  Finally,  if  C  has  no  lower 
right  number,  the  arrow  is  assumed  to  have  the  value  of  the  upper  right  number  of  C. 

An  unlabelled  arrow  is  assumed  to  be  an  act  of  Specialization  or  the  creation  of  an 
Example.'^  Labels,  when  they  do  occur,  are  given  in  capitals  and  small  letters;  concept 
names  (nodes)  are  by  contrast  in  all  capitals. 

All  the  numbers  correspond  to  those  given  to  the  tasks  in  the  task-by-task  traces  presented 
in  the  last  section  (p.  115)  and  in  Appendix  5  (p.  294). 

The  first  part  of  this  graph  (presented  below)  contains  static  structural  (and  ultimately 
numerical)  concepts  which  were  studied  by  AM: 


STRUCTURES 


iEHPTY  .  153  15S 


The  rest  of  the  graph  (presented  on  the  next  page)  deals  with  activities  which  were 
investigated: 


*  ^  Thl«  i(  ofttn  tru«  bcctuit  iiitny  conctptf  art  crtatad  whilt  chtcking  axamplat  of  aoma  known  conctpt. 

1 2 

It  thoukf  b«  elaar  in  aach  eonlaxt  which  la  happaning.  If  not,  rafar  to  tha  abort  traca  in  tha  pracading  aaction,  and  look  up 
tha  appropriata  taak  numbar. 
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6.I.S.  AM  as  a  Computer  Program 

When  viewed  as  a  large  LISP  program,  there  is  very  little  of  interest  about  AM.  There  are 
the  usual  battery  of  customized  functions  (e.g.,  a  conditional  PRINT  function),  the  storage 
hacks  (special  emergency  garbage  collection  routines,  which  know  which  facets  are 
expendible),  the  time  hacks  (omniscicntly  arrange  clauses  in  a  conjunction  so  that  the  one 
most  likely  to  fail  will  come  first),  and  the  bugs  (if  the  user  renames  a  concept  while  it’s  the 
current  one  being  worked  on,  there  is  a  hf,  chance  of  AM  entering  an  infinite  loop). 

Below  are  listed  a  few  parameters  of  the  system,  although  I  doubt  that  they  hold  any 
theoretical  significance.  The  reader  may  be  curious  about  how  big  AM,  how  long  it  takes  to 
execute,  etc. 

Machine:  SUMEX,  PDP-10,  KMO  uniprocessor,  256k  core  memory. 

Language:  Interlisp,  January  ’75  release,  which  occupies  140k  of  the  total  256k,  but  which 
provides  a  surplus  "shadow  space"  of  256k  additional  words  available  for  holding  compiled 
code. 

AM  support  code:  200  compiled  (not  block-compiled)  utility  routines,  control  routines,  etc. 
They  occupy  roughly  lOOk,  but  all  are  pushed  into  the  shadow  space. 

AM  itself:  115  concepts,  each  occupying  about  .7k  (about  two  typed  pages,  when  Pretty- 
printed  with  indentation).  Facet/entries  stored  as  property/value  on  the  property  list  of 
atoms  whose  names  are  concepts’  names.'®  Each  concept  has  about  8  facets  filled  in. 

Heuristics  are  tacked  onto  the  facets  of  the  concepts.  The  more  general  the  concept,  the 
more  heuristic  rules  it  has  attached  to  it.'^  "Any-concept"  has  121  rules;  "Active  concept" 
has  24;  "Coalesce"  has  7;  "Set-lnseition"  has  none.  There  are  250  heuristic  rules  in  all, 
divided  into  4  flavors  (Fillin,  Check,  Suggest,  Interestingness).  Although  the  mean  number 
of  rules  is  therefore  only  about  2.2  (i.e.,  less  than  I  of  each  flavor)  per  concept,  the  standard 
deviation  of  this  is  a  whopping  127.4.  The  average  number  of  heuristics  (of  a  given  flavor) 
encountered  rippling  upward  from  a  randomly-chosen  concept  C  (along  the  network  of 
generalization  links)  is  about  35,  even  though  the  mean  path  length  is  only  about  4.'® 

The  total  number  of  jobs  executed  in  a  typical  run  (from  scratch)  is  about  200.  The  run 
ends  because  of  space  problems,  but  AM’s  performance  begins  to  degrade  near  the  end 
anyway. 

"Final"  state  of  AM:  300  concepts,  each  occupying  about  Ik.  Many  are  swapped  out  onto 
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If  tha  hauriatica  were  homojeneoualy  diatributed  among  tht  eoneepta,  tha  number  of  haurittiea  (of  a  given  type)  along  a 
typical  path  of  length  4  would  only  be  about  2,  not  35  If  all  tha  hauriatica  ware  tacked  onto  Anything  and 
Any-eoncapt,  tha  number  encountered  in  any  path  would  be  75,  not  35 
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disk.  Number  of  winning  concepts  discovered;  25  (estimated).  Number  of  acceptable 
concepts  defined:  100  (est.).'®  Number  of  losing  concepts  unfortunately  worked  on:  60  (est.). 
The  original  115  concepts  have  grown  to  an  average  size  of  2k.  Each  concept  has  about  1 1 
facets  filled  in. 

About  30  seconds  of  cpu  time  were  allocated  to  each  task,  on  the  average,  but  the  task 
typically  used  only  about  18  seconds  before  quitting.  Total  CPU  time  for  a  run  is  about  1 
hour.  Total  cpu  time  consumed  by  this  research  project  was  about  500  cpu  hours. 

Real  time:  about  1  minute  per  task,  2  hours  per  run.  The  idea  for  AM  was  formulated  in 
the  Fall  of  1974,  and  AM  was  coded  in  the  summer  of  1975.  Total  time  consumed  by  this 
project  to  date  has  been  about  2500  man-hours:  700  for  planning,  500  for  coding,  600  for 
modifying  and  debugging  and  experimenting,  and  700  for  writing  this  thesis. 


6.2.  Experiments  with  AM 

Now  we’ve  described  the  activities  AM  carried  out  during  its  best  run.  AM  was  working 
by  itself,  and  each  time  executed  the  top  task  on  the  agenda.  It  received  no  help  from  the 
user,  and  all  its  concepts’  Intuitions  facets  had  been  removed. 

One  valuable  aspect  of  AM  is  that  it  is  amenable  to  many  kind  of  interesting  experiments. 
Although  AM  is  too  ad  hoc  for  numerical  results  to  have  much  significance,  the  qualitative 
results  perhaps  do  have  some  valid  things  to  say  about  research  in  elementary  mathematics, 
about  automating  research,  and  at  least  about  the  efficacy  of  various  parts  of  AM’s  design. 

This  section  will  explain  what  it  means  to  perform  an  experiment  on  AM,  what  kinds  of 
experiments  are  imaginable,  which  of  those  are  feasible,  and  finally  will  describe  the  many 
experiments  which  were  performed  on  AM. 

By  modifying  AM  in  various  ways,  its  behavior  can  be  altered,  and  the  quality  of  its 
behavior  will  change  as  well.  As  a  drastic  example,  one  experiment  involved  forcing  AM 
to  select  the  next  task  to  work  on  randomly  from  the  agenda,  not  the  top  task  each  time. 
Needless  to  say,  the  performance  was  very  different  from  usual. 

By  careful  planning,  each  experiment  can  tell  us  something  new  about  AM:  how  valuable  a 
certain  piece  of  it  is,  how  robust  a  certain  scheme  really  is,  etc.  The  results  of  these 
experiments  would  then  have  something  to  contribute  to  a  discussion  of  the  "real 
intelligence"  of  AM  (e.g.,  what  features  were  superfluous),  and  contribute  to  the  design  of 
the  "next"  AM-like  system.  Generalizing  from  those  results,  one  might  suggest  some 
hypotheses  about  the  larger  task  of  automated  math  research. 

Let’s  cover  the  different  kinds  of  experiments  one  could  perform  on  AM: 

(i)  Remove  individual  concept  modules,  and/or  individual  heuristic  rules.  Then  examine 
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how  AM’s  performance  is  degraded.  AM  should  operate  even  if  most  of  its  heuristic  rules 
and  most  of  its  concept  modules  were  excised.  If  the  remaining  fragment  of  AM  is  too 
small,  however,  it  may  not  be  able  to  find  anything  interesting  to  do.  In  fact,  this  situation 
was  actually  encountered  experimentally,  when  the  first  few  partially  complete  concepts  were 
inserted.  If  only  a  little  bit  of  AM  is  removed,  the  remainder  will  in  fact  keep  operating 
without  this  "I'ninteresting  collapse".  The  converse  situation  should  also  hold:  although  still 
functional  with  any  concept  module  unplugged,  AM’s  performance  should  be  noticeably 
degraded.  That  is,  while  not  indispensable,  each  concept  should  nontrivially  help  the 
others.  The  same  holds  for  each  individual  heuristic  rule.  When  a  piece  of  AM  is 
removed,  which  concepts  does  AM  then  "miss"  discovering?  Is  the  removed 
concept/heuristic  later  discovered  anyway  by  those  which  are  left  in  AM?  This  should 
Indicate  the  importance  of  each  kind  of  concept  and  rule  which  AM  starts  with. 

(ii)  Vary  the  relative  weights  given  to  features  by  the  criteria  which  judge  aesthetics, 
interestingness,  worth,  utility,  etc.  See  how  important  each  factor  is  in  directing  AM  along 
successful  routes.  In  other  words,  vary  the  little  numbers  in  the  formulae  (both  the  global 
priority-assigning  formula  and  the  local  reason-rating  ones  inside  heuristic  rules).  One 
important  result  will  be  some  idea  of  the  robustness  or  "toughness"  of  the  numeric  weighting 
factors.  If  the  system  easily  collapses,  it  was  too  finely  tuned  to  begin  with. 

(iii)  Add  several  new  concept  modules  (including  new  heuristics  relevant  to  them)  and  see  if 
AM  can  work  in  some  unanticipated  field  of  mathematics  (like  graph  theory  or  calculus  or 
plane  geometry).  Do  earlier  achievements  -  concepts  and  conjectures  AM  synthesized 
already  -  have  any  impact  in  the  new  domain?  Are  some  specialized  heuristics  from  the 
first  domain  totally  wrong  here?  Do  all  the  old  general  heuristics  still  hold  here?  Are  they 
sufficient,  or  are  some  "general"  heuristics  needed  here  which  weren’t  needed  before?  Does 
AM  "slow  down"  as  more  and  more  concepts  get  Introduced? 

(iv)  Try  to  have  AM  develop  nonmathematical  theories  (like  elementary  physics,  ot 
program  verification).  This  might  require  limiting  AM’s  freedom  to  "Ignore  a  given  body 
of  data  and  move  on  to  something  more  interesting".  The  exploration  of  very  non- 
formalizable  fields  (e.g.,  politics)  might  require  much  more  than  a  small  augmentation  of 
AM’s  base  of  concepts.  For  some  such  domains,  the  "Intuitions"  scheme,  which  had  to  be 
abandoned  for  math,  might  prove  valid  and  valuable. 

(v)  Add  several  new  concepts  dealing  with  proof,  and  of  course  add  all  the  associated 
heuristic  rules.  Such  rules  would  advise  AM  on  the  fine  points  of  using  various  techniques 
of  proof/disproof:  when  to  use  them,  what  to  try  next  based  on  why  the  last  attempt  failed, 
etc.  See  if  the  hinds  of  discoveries  AM  makes  are  increased. 

Just  prior  to  the  writing  of  this  document,  several  experiments  (of  types  i,  ii,  and  iii 
above '^)  were  set  up  and  performed  on  AM.  We’re  now  ready  to  examine  each  of  them  in 
detail.  The  following  points  are  covered  for  each  experiment: 

1.  How  was  it  thought  of? 

2.  What  will  be  gained  by  it?  What  would  be  the  implications  of  the  various  possible 

outcomes? 
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3.  How  was  the  experiment  set  up?  What  preparations/modifications  had  to  be  made? 

How  much  time  (man-hours)  did  it  take? 

4.  What  happened?  How  did  AM’s  behavior  change?  Was  this  expected?  Analysis. 

5.  What  was  learned  from  this  experiment?  Can  we  conclude  anything  which  suggests 

new  experiments  (e.g.,  use  a  better  machine,  a  new  domain)  or  which  bears  on  a 
more  general  problem  that  AM  faced  (e.g.,  a  new  way  to  teach  math,  a  new  idea 
about  doing  math  research)? 


6.2.1.  Must  the  Worth  numbers  be  finely  tuned? 

Each  of  the  115  initial  concepts  has  supplied  to  it  (by  the  author)  a  number  between  0  and 
1000,  stored  as  its  Worth  facet,  which  is  supposed  to  represent  the  overall  value  of  the 
concept.  "Compose"  has  a  higher  initial  Worth  than  "Structure-delete",  which  is  higher 
than  "Equality"'*. 

Frequently,  the  priority  of  a  task  involving  C  depends  on  the  overall  Worth  of  C.  How 
sensitive  is  AM’s  behavior  to  the  initial  settings  of  the  Worth  facets?  How  finely  tuned 
must  these  initial  Worth  values  be? 

This  experiment  was  thought  of  because  of  the  'brittleness’  of  many  other  A I  systems,  the 
amount  of  fine  tuning  needed  to  elicit  coherent  behavior.  For  example,  see  the  discussion  of 
limitations  of  PUPS,  in  [Lenat  76b].  The  author  believed  that  AM  was  very  resilient  in 
this  regard,  and  that  a  demonstration  of  that  fact  would  increase  credibility  in  the  power  of 
the  ideas  which  AM  embodies. 

To  test  this,  a  simple  experiment  was  performed.  Just  before  starting  AM,  the  mean  value 
of  all  concepts’  Worth  values  was  computed.  It  turned  out  to  be  roughly  200.  Then  each 
concept  had  its  Worth  reset  to  the  value  200.'®  This  was  done  "by  hand’’,  by  the  author,  in 
a  matter  of  seconds.  AM  was  then  started  and  run  as  If  there  were  nothing  amiss,  and  its 
behavior  was  watched  carefully. 

What  happened?  By  and  large,  the  same  major  discoveries  were  made  -  and  miij-  ’  -  as 
usual,  in  the  same  order  as  usual.  But  whereas  AM  proceeded  fairly  smoothly  before,  with 
little  superfluous  activity,  it  now  wandered  quite  blindly  for  long  periods  of  time,  especially 
at  the  very  beginning.  Once  AM  "hooked  into"  a  line  of  productive  development,  it 
followed  it  Just  as  always,  with  no  noticeable  additional  wanderings.  As  one  of  these  lines 
of  developments  died  out,  AM  would  wander  around  again,  until  the  next  one  was  begun. 

It  took  roughly  three  times  as  long  for  each  major  discovery  to  occur  as  normal.  This 
"delay"  got  shorter  and  shorter  as  AM  developed  further.  In  each  case,  the  tasks  preceding 
the  discovery  and  following  it  were  pretty  much  the  same  as  normal;  only  the  tasks 
"between"  two  periods  of  development  were  different  -  and  much  more  numerous.  The 
precise  numbers  Involved  would  probably  be  more  misleading  than  helpful,  so  they  will  not 


I  ft 

A*  AM  prorratttt,  H  nolkat  tomtlhinf  inlirttlinf  about  Equality  tvary  now  and  than,  and  puahat  ita  Worth  valua 
upwards. 
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Tha  initial  aprtad  of  valuat  was  from  100  to  600. 
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be  given^°. 

The  reader  may  be  interested  to  learn  that  the  Worth  values  of  many  of  the  concepts  -  and 
most  of  the  new  concepts  -  ended  up  very  close  to  the  same  values  that  they  achieved  in 
the  original  run.  Overrated  concepts  were  investigated  and  proved  boring;  underrated 
concepts  had  to  wait  longer  for  their  chances,  but  then  quickly  proved  interesting  and  had 
their  Worth  facets  boost.'d. 

The  conclusion  I  draw  from  this  change  in  behavior  is  that  the  Worth  facets  are  useful  for 
making  blind  decisions  -  where  AM  must  choose  based  only  on  the  overall  worths  of  the 
various  concepts  in  its  repertoire.  Whenever  a  specific  reason  existed,  it  was  far  more 
influential  than  the  "erroneous”  Worth  values.  The  close,  blind,  nandom  decisions  occur 
between  long  bursts  of  specific-reason-dnven  periods  of  creative  work.^' 

The  general  answer,  then,  is  No,  the  initial  settings  of  the  Worth  values  are  not  crucial. 
Guessing  reasonable  initial  values  for  them  is  merely  a  time-saving  device.  This  suggests 
an  interesting  research  problem:  what  impact  does  the  quality  of  initial  starting  values  have 
on  humans?  Give  several  bright  undergraduate  math  majors  the  same  set  of  objects  and 
operators  to  play  with,  but  tell  some  of  them  (i)  nothing,  and  some  of  them  (ii)  a  certain  few 
pieces  of  the  system  are  very  promising,  (lii)  emphasize  a  different  subset  of  the  objects  and 
operators.  How  does  "misinformation"  impede  the  humans?  How  about  no  information? 
Have  them  give  verbal  protocols  about  where  they  are  focussing  their  attention,  and  why. 

Albeit  at  a  nontrivial  cost,  the  Worth  facets  did  manage  to  correct  themselves  by  the  end  of 
a  long^^  run.  What  would  happen  if  the  Worth  facets  of  those  1 15  concepts  were  not  only 
initialized  to  200,  but  were  held  fixed  at  200  for  the  duration  of  the  run? 

In  this  case,  the  delay  still  subsided  with  time.  That  is,  AM  still  got  more  and  more  "back 
to  normal"  as  it  progressed  onward.  The  reason  is  because  AM’s  later  work  dealt  with 
concepts  like  Primes,  Square-root,  etc.,  which  were  so  far  removed  from  the  initial  base  of 
concepts  that  the  initial  concepts’  Worths  wsrc  of  little  consequence. 

Even  more  drastically,  we  could  force  all  the  Worth  facets  of  all  concepts  -  even  newly- 
created  ones  -  to  be  kept  at  the  value  200  forever.  In  this  case,  AM’s  behavior  doesn’t 
completely  disintegrate,  but  that  delay  factor  actually  increases  with  time:  apparently,  AM 
begins  to  suffer  from  the  exponential  growth  of  "things  to  do"  as  its  repertoire  of  concepts 
grows  linearly.  Its  purposiveness,  its  directionality  depends  on  "focus  of  attention"  more  and 
more,  and  if  that  feature  is  removed,  AM  loses  much  of  its  rationality.  A  factor  of  5  delay 
doesn’t  sound  that  bad  "efficiency-wise",  but  the  actual  apparent  behavior  of  AM  is  as 
staccato  bursts  of  development,  followed  by  wild  leaps  to  unrelated  concepts.  AM  no  longer 
can  "permanently"  record  its  interest  in  a  certain  concept. 

So  we  conclude  that  the  Worth  facets  are  (i)  not  finely  tuned,  yet  (ii)  provide  important 


20 

Any  r®*d®r  who  wuhes  to  perform  this  experiment  cen  «tmply  say  [MAPC  CONCEPTS  '(LAMBDA  (r.)  (SETB  c  WORTH 
200]  to  Inter'isp,  before  typing  (START)  io  begin  AM 
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Irvcidentally,  GPS  behaved  this  same  way  See,  eg,  [Neweti&Simon  72] 
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Chapter  6 


AM:  Oiacovery  in  Mathematic*  a*  Heuri*tic  Search 


-129- 


globat  information  about  the  reh  tve  values  of  concepts.  If  the  Worth  facets  are  completely 
disabled,  the  rationality  of  AM’s  behavior  hangs  on  the  slender  thread  of  "focus  of 
attention". 


6.2.2.  How  finely  tuned  is  the  Agenda? 

The  top  few  candidates  on  the  agenda  always  appear  to  be  reasonable  (to  me).  If  I  work 
with  the  system,  guiding  it,  I  can  cause  it  to  make  a  few  discoveries  it  wouldn’t  otherwise 
make,  and  I  can  cause  it  to  make  its  typical  ones  much  faster  (about  a  factor  of  2).  Thus  the 
very  top  task  is  not  always  the  best. 

If  AM  randomly  selects  one  of  the  top  20  or  so  tasks  on  the  agenda  each  time,  what  will 
happen  to  its  behavior?  Will  it  disintegrate,  slow  down  by  a  factor  of  10,  slow  down 
slightly,...? 

This  experiment  required  only  a  few  seconds  to  set  up,  but  demanded  a  familiarity  with  the 
LISP  functions  which  make  up  AM’s  control  structure.  At  a  certain  point,  AM  asks  for 
Best-task(Agenda).  Typically,  the  LISP  function  Best-task  is  defined  as  CAR  -  i.e.,  pick  the 
first  member  from  the  list  of  tasks.  What  I  did  was  to  redefine  Best-task  as  a  function 
which  randomly  selected  n  from  the  set  {1,2,..., 20},  and  then  returned  the  n*'’  member  of  the 
job-list. 

If  you  watch  the  top  Job  on  the  agenda,  it  will  take  about  10  cycles  until  AM  chooses  it. 
And  yet  there  are  many  good,  interesting,  worthwhile  jobs  sprinkled  among  the  top  20  on 
the  agenda,  so  AM’s  performance  is  cut  by  merely  a  factor  of  3,  as  far  as  cpu  time  per  given 
major  discovery.  Part  of  this  better-than-20  behavior  is  due  to  the  fact  that  the  18*^  best 
task  had  a  much  lower  priority  rating  than  the  top  few,  hence  was  allocated  much  less  cpu 
time  for  its  quantum  than  the  top  task  would  have  received.  Whether  it  succeeded  or 
failed,  it  used  up  very  little  time.  Since  AM  was  frequently  working  on  a  low-value  task,  it 
was  unwilling  to  spend  much  time  or  space  on  it.  So  the  mean  time  allotted  per  task  fell  to 
about  15  seconds  (from  the  typical  30  secs).  Thus,  the  "losers"  were  dealt  with  quickly,  so  the 
detriment  to  cpu-time  performance  was  softened. 

Yet  AM  is  much  less  rational  in  its  sequencing  of  tasks.  A  topic  will  be  dropped  right  in  the 
middle,  for  a  dozen  cycles,  then  picked  up  again.  Often  a  "good"  task  will  be  chosen, 
having  reasons  all  of  which  were  true  10  cycles  ago  -  and  which  are  clearly  superior  to 
those  of  the  last  10  tasks.  This  is  what  is  so  annoying  to  human  onlookers. 

To  carry  this  investigation  further,  another  experiment  was  carried  out.  AM  was  forced  to 
alternate  between  choosing  the  top  task  on  the  agenda,  and  a  randomly-chosen  one. 
Although  its  rate  of  discovery  was  cut  by  less  than  half,  its  behavior  was  almost  as 
distasteful  to  the  user  as  in  the  last  (always-random)  experiment. 

Conclusion:  Picking  (on  the  average)  the  lOth-best  candidate  impedes  progress  by  a  factor 
less  than  10  (about  a  factor  of  3),  but  it  dramatically  degrades  the  "sensibleness"  of  AM’s 
behavior,  the  continuity  of  its  actions.  Humans  place  a  big  value  on  absolute  sensibleness, 
and  believe  that  doing  something  silly  507,  of  the  time  is  much  worse  than  half  as 
productive  as  always  doing  the  next  most  logical  task. 
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CoroHary:  Having  20  multi-processors  simultaneously  execute  the  top  20  jobs  will  increase 
the  rate  of  "big"  discoveries,  but  not  by  a  full  factor  of  20. 

Another  experiment  in  this  same  vein  was  done,  one  which  was  designed  to  be  far  more 
crippling  to  AM.  Be-threshhold  was  held  at  0  always,  so  any  task  which  ever  got  proposed 
was  kept  forever  on  the  agenda,  no  matter  how  low  its  priority.  The  Best-task  function  wa.' 
modified  so  it  randomly  selected  any  member  of  the  list  of  jobs.  As  a  final  insult,  the  Worth 
facets  of  all  the  concepts  were  initialized  to  200  before  starting  AM. 

Result:  Many  "explosive"  tasks  were  chosen,  and  the  number  of  new  concepts  increased 
rapidly.  As  expected,  most  of  these  were  real  "losers".  There  seemed  no  rationality  to  AM’s 
sequence  of  actions,  and  it  was  quite  boring  to  watch  it  floundering  so.  The  typical  length 
of  the  agenda  was  about  500,  and  AM’s  performance  was  "slowed"  by  at  least  a  couple 
orders  of  magnitude.  A  more  subjective  measure  of  its  "intelligence"  would  say  that  it 
totally  collapsed  under  this  random  scheme. 

Conclusion;  Having  an  unlimited  number  of  processors  simultaneously  execute  all  the  jobs 
on  the  agenda  would  increase  the  rate  at  which  AM  made  big  discoveries,  at  an  ever 
accelerating  pace  (since  the  length  of  the  agenda  would  grow  exponentially). 

Having  a  uniprocessor  simulate  such  parallel  processing  would  be  a  losing  idea,  however. 
The  truly  "intelligent"  behavior  AM  exhibits  is  fts  plausible  sequencing  of  tasks. 


6.2.3.  How  valuable  is  tacking  reasons  onto  each  task? 

Let’s  dig  inside  the  agenda  scheme  now.  One  idea  I’ve  repeatedly  emphasized  is  the 
attaching  of  reasons  to  the  tasks  on  the  agenda,  and  using  those  reasons  and  their  ratings  to 
compute  the  overall  priority  value  assigned  to  each  task.  An  experiment  was  done  to 
ascertain  the  amount  of  intelligence  that  was  emanating  from  that  idea. 

The  global  formula  assigning  a  priority  value  to  each  job  was  modified.  We  let  it  still  be  a 
function  of  the  reasons  for  the  job,  but  we  "trivialized"  it:  the  priority  of  a  job  was 
computed  as  simply  the  number  of  reasons  it  has  (normalized  by  multiplying  by  100,  and 
cut-off  if  over  1000). 

This  raised  the  new  question  of  what  to  do  if  several  jobs  all  have  the  same  priority.  In 
that  case,  I  had  AM  execute  them  in  stack-order  (most  recent  first)^®. 

Result:  I  secretly  expected  that  this  wouldn’t  make  too  much  difference  on  AM’s  apparent 
level  of  directionality,  but  such  was  definitely  not  the  case.  While  AM  opened  by  doing 
tasks  which  were  far  more  interesting  and  daring  than  usual  (e.g.,  filling  in  various 
Coalescings  right  away),  it  soon  became  obvious  that  AM  was  being  swayed  by  hitherto 
trivial  coding  decisions.  Whole  classes  of  tasks  -  like  Checking  Examples  of  C  -  were 
never  chosen,  because  they  only  had  one  or  two  reasons  supporting  them.  Previously,  one 
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or  two  good  reasons  were  sufficient.  Now,  tasks  with  several  poor  reasons  were  rising  to  the 
top  and  being  worked  on.  Even  the  LIFO  (stack)  policy  for  resolving  ties  didn’t  keep  AM’s 
attention  focussed. 

Conclusion;  Unless  a  conscious  effort  is  made  to  ensure  that  each  reason  really  will  carry 
roughly  an  eq'  al  amount  of  semantic  impact  (charge,  weight),  it  is  not  acceptable  merely  to 
choose  tasks  on  the  basis  of  how  many  reasons  they  possess.  Even  in  those  constricted 
equal-weight  cases,  the  similarities  between  reasons  supporting  a  task  should  be  taken  into 
account. 

Another  experiment,  not  yet  performed,  will  pin  down  the  value  of  this  rule-attaching  i'^^a 
even  more  precisely.  A  threshhold  value  -  say  400  -  will  be  fixed.  Any  reason  whose  rating 
is  above  that  threshhold  will  be  called  a  good  reason,  and  every  other  reason  will  be  a 
minor  reason.  Then  tasks  will  be  ordered  by  the  number  of  good  reasons  they  possess,  and 
ties  will  be  broken  by  the  number  of  minor  reasons.  Still  another  experiment  would  be  to 
randomly  pick  any  task  with  at  least  one  good  reason. 


6.2.4.  What  if  certain  concepts  are  eliminated/added? 

Feeling  in  a  perverse  mood  one  day,  I  eliminated  the  concept  "Equality”  from  AM,  to  see 
what  it  would  then  do.  Equality  was  a  key  concept,  because  AM  discovered  Numbers  via 
the  technique  of  generalizing  the  relation  "Equality"  (exact  equality  of  2  given  structures,  at 
all  internal  levels).  What  would  happen  if  we  eliminate  this  path?  Will  AM  rederive 
Equality?  Will  it  get  to  Cardinality  via  another  route?  Will  it  do  some  set-theoretic  things? 

Result:  Rather  disappointing.  AM  never  did  re-derive  Equality,  nor  Cardinality.  It  spent  its 
time  thrashing  about  with  various  flavors  of  data-structures  (unordered  vs.  ordered, 
multiple-elements  allowed  or  not,  etc.),  deriving  large  quantities  of  boring  results  about 
them.  Very  many  composings  and  coalescings  were  done,  but  no  exciting  new  operations 
were  produced. 

It  is  expected  that  eliminating  other,  less  central  concepts  than  Equality  will  do  less  damage 
to  AM’s  progress.  The  reader  is  invited  to  try  such  experiments  himself. 

To  eliminate  a  concept,  like  equality,  one  need  merely  type  KILB{OBJ -EQU AUTY^^)  at  the 
beginning  of  the  session,  before  typing  (START). 

An  even  kinder  type  of  experiment  would  be  to  add  a  few  concepts.  One  such  experiment 
was  done:  the  addition  of  Cartesian-product.  This  operation,  named  C-PROD,  accepts  two 
sets  as  arguments  and  returns  a  third  set  as  its  value:  the  Cartesian  product  of  the  first  two. 

Result:  The  only  significant  change  in  AM’s  behavior  was  that  TIMES  was  discovered  first 
as  the  restriction  of  C-PROD  to  Canonical-Bags.  When  it  soon  was  rediscovered  in  a  few 
other  guises,  its  Worth  was  even  higher  than  usual.  AM  spent  even  more  time  exploring 
concepts  concerned  with  it,  and  deviated  much  less  for  quite  a  long  time. 
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Synthesis  of  the  above  experiments:  It  appears  that  AM  may  really  be  more  specialized  than 
expected;  AM  may  only  be  able  to  forge  ahead  along  one  or  two  main  lines  of  development 
-  at  least  if  we  demand  it  make  very  interesting,  well-known  discoveries  quite  frequently. 
Removing  certain  key  concepts  can  be  disastrous.  On  the  other  hand,  adding  some 
carefully-chosen  new  ones  can  greatly  enhance  AM's  directionality  (hence  its  apparent 
intelligence). 

Conclusion:  In  its  current  state,  AM  is  thus  seen  to  be  minimally  competent  if  any 
knowledge  is  removed,  it  appears  much  less  intelligent;  if  any  is  added,  it  appears  slightly 
smarter. 

Suggestion  for  future  research:  A  hypothe'*-  which  should  be  test>.u  experimentally,  is  that 
the  importance  of  the  presence  of  each  indiviuual  concept  decreases  as  the  number  of  -  and 
deptA  of  -  the  synthesized  concepts  increase.  That  is,  any  excision  would  eventually  "heal 
over",  given  enough  time.  The  failure  of  AM  to  verify  this  may  be  due  to  the  relatively 
small  amount  of  development  in  toto  (an  hour  of  epu  time,  a  couple  hundred  new  concepts, 
a  few  levels  deeper  than  the  starting  ones). 


6.2.5.  What  if  certain  heuristics  are  tampered  with? 

The  class  of  experiments  described  by  this  section’s  heading  should  prove  entertaining,  but 
it  will  probably  be  difficult  to  learn  from  their  results. 

Why  is  this?  Some  of  the  heuristics  were  added  to  correct  a  specific  problem;  removing 
them  would  simply  re-initiate  that  problem.  Others  were  never  actually  used  by  AM,  so 
their  deletion  would  have  no  effect.  If  AM  enlarged  the  range  of  what  it  worked  on,  their 
absence  might  then  be  felt. 

What  good  would  these  experiments  be,  then?  We  might  learn  something  about  the 
"redundancy  of  reasoning  chains".  We’d  stop  AM  just  before  it  made  a  big  discovery, 
remove  the  heuristic  rule  it  was  about  to  use,  and  sec  if  it  ever  makes  that  big  discovery 
anyway,  later  on.  If  not,  perhaps  the  discarded  rule  was  very  important,  or  there  are 
alternate  rules  which  e.xist  but  haven’t  been  inserted  in  AM.  If  the  same  discovery  is  made 
by  an  alternate  route,  does  that  indicate  an  unexpected  duplication  of  heuristic  knowledge? 
if  heuristic  H2  is  used  now,  instead  of  HI,  does  that  suggest  a  new  meta-rule:  "if  you  want 
to  apply  one  of  H 1/H2  but  can’t,  see  if  the  other  rule  can  be  applied."?  Is  that  last  sentence 
really  a  Meta-meta-rule? 

Before  this  discussion  enters  an  infinite  loop.  I’d  better  extract  myself  -  and  the  reader  -  by 
commenting  that  there  may  be  an  idea  in  all  this,  perhaps  of  use  to  whoever  writes  Meta- 
AM.  It  was  decided  not  to  carry  out  a  systematic  series  of  experiments  of  this  type  until 
AM  is  much  further  developed  in  abilities. 


Chapttr  6 


AM:  Discoviry  in  Mi'htmilict  at  Hcurittic  Search 


-133- 


6.2.6.  Can  AM  work  in  a  new  domain:  Plane  Geometry? 


A  true  strategy  should  be  domain-independent. 

—  Adams 


As  McDermott  points  out  [McDermott  76],  just  labelling  a  bunch  of  heuristics  ‘Operation 
heuristics’  doesn't  suddenly  make  them  relevant  to  any  operation;  all  it  does  is  give  that 
impression  to  a  human  who  looks  at  the  code  (or  a  description  of  it).  Since  the  author 
hoped  that  the  labelling  really  was  fair,  an  experiment  was  done  to  test  this.  Such  an 
experiment  would  be  a  key  determiner  of  how  general  AM  is. 

How  might  one  demonstrate  that  the  "Operation"  heuristics  really  could  be  useful  or  dealing 
with  any  operation,  not  Just  the  ones  already  in  AM’s  initial  base  of  concepts? 

One  way  would  be  to  pick  a  new  domain,  and  see  how  many  old  heuristics  contribute  to  — 
and  how  many  new  heuristics  have  to  be  added  to  elicit  -  some  sophisticated  behavior  in 
that  domain.  Of  course,  some  new  primitive  concepts  would  have  to  be  introduced  (defined) 
to  AM. 

Only  one  experiment  of  this  type  was  attempted.  The  author  added  a  new  base  of  concepts 
to  the  ones  already  in  AM.  Included  were:  Point,  Line.  Angle,  Triangle,  Equality  of 
points/lines/angles/triangles.  The.se  simple  pldtie  geometry  notions  were  sufficiently  removed 
from  set-thecrctk  ones  that  those  pre-existing  specific  concepts  would  be  totally  irrelevant; 
on  the  other  hand,  the  general  concepts  -  the  ones  with  the  heuristics  attached  -  would  still 
be  just  as  relevant;  Any-concept,  Operation,  Predicate,  Structure,  etc. 

For  each  new  geometric  concept,  the  only  facet  filled  in  was  its  Definition.  For  the  new 
predicates  and  operators,  their  Domain/range  entries  were  also  supplied.  No  new  heuristics 
were  added  to  AM. 

Results:  fairly  good  behavior.  AM  was  able  to  find  examples  of  all  the  concepts  defined, 
?nd  to  use  the  character  of  the  results  of  those  examples  searches  to  determine  intelligent 
courses  of  action.  AM  derived  congruence  and  similarity  of  triangles,  and  several  other 
well-known  simple  concepts.  An  unusual  result  was  the  repeated  derivation  of  the  idea  of 
"timberline".  This  is  a  predicate  on  two  triangles:  Timberline(Tl,T2)  iff  T1  and  T2  have  a 
common  angle,  and  the  side  opposite  that  angle  in  the  two  triangles  are  parallel: 
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A 


Timberline(ABC,ADE) 


Since  AM  kept  rederiving  this  in  new  ways,  it  seem?  surprising  that  thete  is  no  very 
common  name  for  the  concept.  It  could  be  that  AM  is  using  techniques  which  humans  don’t 
-  at  least,  for  geometry. 

The  only  new  bit  of  knowledge  that  came  out  of  this  experiment  was  a  "u-sp”  for  Coldbach’s 
conjecture:  any  angle  (0-180  degrees)  can  be  built  up  (to  within  I  degree)  as  the  sum  of  two 
angles  of  prime  degrees  (<I80).  This  result  is  admittedly  esoteric  at  best,  but  is  nonetheless 
woith  reporting. 

The  total  effort  expended  on  this  experiment  was;  a  few  months  of  subconscious  processing, 
ten  hours  of  designing  the  base  of  concepts  to  insert,  ten  hours  inserting  and  debugging 
them.  The  whole  task  took  about  two  days  of  real  time. 

The  conclusion  to  be  drawn  is  that  heuristics  really  can  be  generally  useful;  their  attachment 
to  general-sounding  concepts  is  not  an  illusion.^®  The  implication  of  this  is  that  AM  can  be 
grown  incrementally,  domain  by  domain.  Adding  expertise  in  a  new  domain  lequires  only 
the  introduction  of  concepts  local  to  that  domain;  ail  the  very  general  concepts  -  and  their 
heuristics  -  already  exist  and  can  be  used  with  no  change. 

The  author  feels  that  this  result  can  be  generalized:  AM  can  be  expanded  in  scope,  even  to 
non-mathematical  fields  of  endeavor.  In  each  field,  however,  the  rankings  of  the  various 
heuristics^®  may  shift  slightly.  As  the  domain  gets  further  away  from  mathematics,  various 
heuristics  are  important  which  were  ignorable  before  (e.g.,  those  dealing  with  ethics),  and 
some  pure  math  research-oriented  heuristics  become  less  applicable  ("giving  up  and  moving 
on  to  another  topic"  is  not  an  acceptable  response  to  the  15-puzzle,  nor  to  a  hostage 
situation). 

Well,  it  sounds  as  if  we’ve  shifted  our  orientation  from  ‘Results’  to  a  subjective  evaluation 
of  those  results.  Let’s  start  a  new  chapter  to  legitimize  this  type  of  commentary. 

25 

Or  it'*  *  v*ry  food  ilkition!  But  note  if  Ihl*  ph*nom«non  it  r*p*at*bl«  *nd  ut*ful,  th*n  (lilt*  N*wtoni*n  mtchinict)  it 
won't  pr«gm«tic*lly  m*tt*r  wh«th*r  it'*  only  *n  illution. 

26 

th«  numtrie  viluo*  that  thoukf  b«  r*turn«d  by  tho  local  rating  a  formula*  which  ir*  atlachod  to  th«  hauriatic  rul** 


This  chapter  contains  discussions  "meta"  to  AM  itself. 

First  comes  an  essay  about  judging  the  performance  of  a  system  like  AM.  This  is  a  very 
hard  task,  since  AM  has  no  "goal".  Even  using  current  mathematical  standards,  should  AM 
be  judged  on  what  it  produced,  or  the  quality  of  the  path  which  led  to  those  results,  or  the 
difference  between  what  it  started  with  and  what  it  finally  derived? 

Section  7.2  then  deals  with  the  capabilities  and  limitations  of  AM: 

•  What  concepts  can  be  elicited  from  AM  now?  With  a  little  tuning/tiny  additions? 

•  What  are  some  notable  omissions  in  AM’s  behavior?  Can  the  user  elicit  these? 

•  What  could  probably  be  done  within  a  couple  months  of  modifications? 

•  Aside  from  a  total  change  of  domain,  what  kinds  of  activities  does  AM  lack  (e.g., 

proof  capabilities)?  Are  any  discoveries  (e.g.,  analytic  function  theory)  clearly 
beyond  its  design  limitations? 

Finally,  all  the  conclusions  will  be  gathered  together,  and  a  short  summary  of  this  project’s 
'contribution  to  knowledge’  will  be  tolerated. 


7.1.  .Tudging  Performance 

One  may  view  AM’s  activity  as  a  progression  from  an  initial  core  of  knowledge  to  a  more 
sophisticated  "final"'  body  of  concepts  and  their  facets.  Then  each  of  the  following  is  a 
reasonable  way  to  measure  success,  to  "Judge"  AM: 

1.  By  AM’s  ultimate  achievements.  Examine  the  list  of  concepts  and  methods  AM 


At  htt  b«tn  tlrttttd  btfort,  AM  hit  no  fixtd  gotl,  no  "finil"  tlalt  For  prtclicti  purpottt,  howivir,  tht  lottlity  of 
txploraliont  by  AM  it  about  tht  tame  at  tht  "best  run  to  far";  tithtr  of  thttt  can  bt  thought  of  at 
defining  what  it  meant  by  the  "fintl'  atate  of  knowladge. 
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developed.  Did  AM  ever  discover  anything  interesting  yet  unknown  to  the  user?‘ 
Anything  new  to  Mankind? 

2.  By  the  character  of  the  difference  between  the  initial  and  final  states.  Progressing 

from  set  theory  to  number  theory  Is  much  more  impressive  than  progressing  from 
two-dimensional  geometry  to  three-dimensional  geometry. 

3.  By  the  quality  of  the  route  AM  took  to  accomplish  these  advances:  How  clever,  how 

circuitous,  how  many  of  the  detours  were  quickly  identified  as  such  and 
abandoned? 

4.  By  the  character  of  the  User-System  interactions:  How  important  is  the  user’s 
guidance?  How  closely  must  he  guide  AM?  What  happens  If  he  doesn’t  say 
anything  ever?  When  he  does  want  to  say  something,  is  there  an  easy  way  to 
express  that  to  AM,  and  does  AM  respond  well  to  it?  Given  a  reasonable  kick  in 
the  right  direction,  can  AM  develop  the  mini-theories  which  the  user  intended,  or 
at  least  something  equally  interesting? 

5.  By  its  intuitive  heuristic  powers:  Does  AM  believe  in  "reasonable"  conjectures?  How 

accurately  does  AM  estimate  the  difficulty  of  tasks  it  is  considering?  Does  AM  tie 
together  (e.g.,  as  analogous)  concepts  which  are  formally  unrelated  yet  which  benefit 
from  such  a  tie? 

6.  By  the  results  of  the  experiments  described  in  Section  6.2  (beginning  on  page  125). 

How  "tuned"  is  the  worth  numbering  scheme?  The  task  priority  rating  scheme? 
How  fragile  is  the  set  of  initial  concepts  and  heuristic  rules?  How  domain-specific 
are  those  heuristics  really?  The  set  of  facets? 

7.  By  the  very  fact  that  the  kinds  of  experiments  outlined  in  Section  6.2  can  easily  be 

"set  up"  and  performed  on  AM.  Regardless  of  the  experiments’  outcomes,  the 
features  of  AM  which  allow  them  to  be  carried  out  at  all  are  worthy  of  note. 

8.  By  the  Implications  of  this  project.  What  can  AM  suggest  about  educating  young 

mathematicians  (and  scientists  in  general)?  What  can  AM  say  about  doing  math? 
about  empirical  research  in  general? 

9.  By  the  number  of  new  avenues  for  research  and  experimentation  it  opens  up.  What 

new  projects  can  we  propose? 

10.  By  comparisons  to  other,  similar  systems. 

For  each  of  these  10  measuring  criteria,  a  subsection  will  now  be  provided,  to  illustrate  (i) 
the  biggest  achievement  and  (li)  the  biggest  failure  of  AM  along  each  dimension,  and  (iii) 
to  try  to  objectively  characterize  AM’s  performance  according  to  that  measure.  Other 
measures  of  Judging  performance  cxist^  of  course,  but  haven’t  been  applied  to  AM. 


7.1.1.  AM’s  Ultimate  Discoveries 


2 

Tlw  *uttr*  i«  *  human  who  worka  with  AM  intaractivtly,  (ivin(  it  hmla,  commandt,  quaationa,  ate  Notico  that  by  "now"  wo 
mean  now  to  the  uaor,  not  now  to  Mankind  Thio  micht  occur  if  tho  uaer  wero  a  child,  and  AM  diocovored 
tome  otomontary  facta  of  arithmotic.  Thit  it  not  roaliy  ao  provincial-  mathomaticiana  taka  "now"  to  mean  now 
to  Mankind,  not  now  in  tho  Univorao  I  fool  philoaophy  alippinf  in,  oo  thia  footnota  ia  tarminatad 

For  oxampio,  Colby  oont  tranacripta  of  a  aoaaion  with  PARRY  to  varioua  paychiatriata,  and  had  thorn  ovaluato  oach 
intoraction  along  aovoral  dimonaiona  Tho  oamo  kind  of  aurvoy  could  bo  dona  for  AM.  A  quito  ooparato 
moaauro  of  AM  would  bo  to  wait  and  ooo  how  many  futu'o  articloa  in  tho  fiold  rofor  to  thio  work  (and  in 
what  lifhti). 
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Two  of  the  ideas  which  AM  proposed  were  totally  new  and  unexpected:^ 

1.  Consider  numbers  with  an  abnormally  high  number  of  divisors.  If  d(n)  represents 

the  number  of  divisors  of  n,®  then  AM  defines  the  set  of  "maximally-divisible 
numbers"  to  be  {n<N  |  (Vm<n)  d(m)<d{n)}.  By  factoring  each  such  number  into 
primes,  AM  noticed  a  regularity  in  them.  The  author  then  developed  a  "mini¬ 
theory"  about  these  numbers.  It  later  turned  out  that  Ramanujan  had  already 
proposed  that  very  same  definition  (in  1915),  and  had  found  that  same  regularity. 
His  results  only  partially  overlap  those  of  AM  and  the  author,  however,  and  his 
methods  are  radically  different. 

2.  AM  found  a  cute  geometric  application  of  Goldbach's  conjecture.  Given  a  set  of  all 

angles  of  prime  degree,  from  0  to  180°,®  then  any  angle  between  0  and  180  degrees 

can  be  approximated  to  within  1°  by  adding  a  pair  of  angles  from  this  prime  set. 
In  fact,  it  is  hard  to  find  smaller  sets  than  this  one  which  approximate  any  angle  to 
that  accuracy. 

By  and  large,  the  other  concepts  which  AM  developed  were  either  already-known,  or  real 
losers.  For  example,  AM  composed  Set-insert  with  the  predicate  Equality.  The  result  was 
an  operation  InsertoEqual(x,y,z),  which  first  tested  whether  x  was  Equal  to  y  or  not.  The 
value  of  this  was  either  True  or  False^  Next,  this  T/F  value  was  inserted  into  z.  For 
example,  InsertoEqual({l,2},{3,4},{5,6})  »  {False,5,6}.  The  first  two  arguments  are  not  equal, 
so  the  atom  ‘False’  was  inserted  into  the  third.  Although  hitherto  "unknown",  this  operation 
would  clearly  be  better  off  left  in  that  state. 

Another  kind  of  loser  occurred  whenever  AM  entered  upon  some  "regular"  behavior.  For 
example,  if  it  decided  that  Compose  was  interesting,  it  might  try  to  create  some  examples  of 
compositions.  It  could  do  this  by  picking  two  operations  and  composing  them.  What  better 
operations  to  pick  than  Compose  and  Compose!  Thus  ComposeoCompose  would  be  born. 
By  composing  that  with  itself,  an  even  more  monstrous  operation  is  spawned: 
ComposeoComposeoComposeoCompose.  Since  AM  actually  uses  the  word  "Compose" 
instead  of  that  little  Infix  circle,  the  PNAME  of  the  data  structure  it  creates  is  horrendous. 
Its  use  is  almost  nonexistent:  it  must  take  5  operations  as  arguments,  and  it  returns  a  new 
operation  which  is  the  composition  of  those  five.  An  analogous  danger  which  exists  is  for 
AM  to  be  content  conjecturing  a  stream  of  very  similar  relationships  (e.g.,  the  multiplication 
table).  In  all  such  cases,  AM  must  have  meta-rules  which  pull  it  up  out  of  such  whirlpools, 
to  perceive  a  higher  generalization  of  its  previous  sequence  of  related  activities. 

In  summary,  then,  we  may  say  that  AM  produced  a  few  winning  ideas  new  to  the  author,  a 
couple  of  which  were  new  to  Mankind.  Several  additional  "new"  concepts  were  created 


a 

rtbl*  that  thata  art  "ultimata  ditcovariaa”  only  in  tha  aanaa  of  what  hat  btan  done  at  the  time  of  writing  thie  theaie  For 
one  of  AM'a  ideaa  to  be  "new",  it  ahould  be  pravioualy  unknown  to  both  the  author  and  tha  uaer.  Why?  If  the 
author  knaw  about  it,  then  the  hauriatka  ha  provided  AM  with  might  unconacioualy  encode  a  path  to  that 
knowladga  If  the  uaar  knaw  about  that  idea,  hia  guidance  might  unconacioualy  help  AM  to  derive  it.  An  even 
more  atringent  interpretation  would  be  that  tha  idea  be  hitherto  unknown  to  tha  collective  written  record  of 
Mathamatka. 


^  eg,  d(12)  •  Siza({l,2,3,A,6,l2})  •  6. 

®  Included  are  0°  and  1°,  aa  well  aa  the  "typical"  primea  2®,  3®,  5®,  7®,  II®,.,  179®. 
^  Actually,  in  LISP,  it  waa  eaaier  to  have  auch  raaulta  alwaya  be  either  T  or  NIL 
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which  both  AM  and  the  user  agreed  were  better  forgotten.  The  "level"  of  AM’s  fruits  could 
be  classified  as  an  undergraduate  math  major,  although  this  is  deceptive  since  AM  lacks  the 
breadth  of  abilities  any  human  being  possesses. 


7.1.2.  The  Magnitude  of  AM’s  Progress 


Even  with  men  of  genius,  with  whom  the  birth  rate  of  hypotheses  is  very  high,  it 
only  just  manages  to  exceed  the  death  rate. 


-  W.  H.  Georfe® 


We  can  ask  the  following  kind  of  question:  how  many  "levels"  did  AM  progress  along?  This 
is  a  fuzzy  notion,  but  basically  we  shall  say  that  a  new  level  is  reached  when  a  valuable  new 
bunch  of  connected  concepts  are  defined  in  terms  of  concepts  at  a  lower  level. 

For  example,  AM  started  out  knowing  about  Sets  and  Set-operations.  When  it  progressed 
to  numbers  and  arithmetic,  that  was  one  big  step  up  to  a  new  level.  When  it  zeroed  in  on 
primes,  unique-factorization,  and  divisibility,  it  had  moved  up  another  level. 

When  fed  simple  geometry  concepts,  AM  moved  up  one  level  when  it  defined  some 
generalizations  of  the  equality  of  geometric  figures  (parallel  lines,  congruent  and  similar 
triangles,  angles  equal  in  measure)  and  their  invariants  (rotations,  translations,  reflections). 

The  above  few  examples  are  unfortunately  exhaustive;  that  Just  about  sums  up  the  major 
advances  AM  made.  Its  progress  was  halted  not  so  much  by  cpu  time  and  (per*  as  by  a 
paucity  of  meta-knowledge:  heuristic  rules  for  filling  in  iicw  heuristic  rules.  Thus  AM’s 
successes  are  finite,  and  its  r^ilurcS  infinite,  along  this  dimension. 

A  more  charitable  view  might  compare  AM  to  a  human  who  was  forced  to  start  from  set 
theory,  with  AM’s  sparse  abilities.  In  that  sense,  perhaps,  AM  would  rate  quite  well.  The 
"unfair"  advantage  it  had  was  the  presence  of  many  heuristics  which  themselves  were 
gleaned  from  mathematicians;  i.e.,  they  are  like  compiled  hindsight.  A  major  purpose  of 
mathematics  education  in  the  university  is  to  instil  these  heuristics  into  the  minds  of  the 
students. 

AM  is  thus  characterized  as  possessing  heuristics  which  are  powerful  enough  to  take  it  a 
few  "levels"  away  from  the  kind  of  knowledge  it  began  with,  but  only  a  few  levels.  The 
limiting  factors  are  (i)  the  heuristic  rules  AM  begins  with,  and  more  specifically  (ii)  the 
expertise  in  recognizing  and  compiling  new  heuristics,  and  more  generally  (iii)  a  lack  of 
real-world  situations  to  draw  upon  for  analogies,  intuitions,  and  applications. 


^  Quoted  from  (Bovoridfi  SO], 
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7.1.3.  The  Quality  of  AM’s  Route 


T hinking  is  not  measured  by  what  is  produced,  but  rather  is  a  property  of  the 
way  something  is  done. 


—  Hamming 


No  .natter  what  its  achievements  were,  or  the  magnitude  of  its  advancement  from  initial 
knowledge,  AM  could®  still  be  judged  "unintelligent"  if,  e.g.,  it  were  exploring  vast  numbers 
of  absurd  avenues  for  each  worthwhile  one  it  found.  The  quality  of  the  route  AM  followed 
is  thus  quite  significant. 

AM  performed  better  in  this  respect  than  expected.  It  is  not  obvious'®  how  well  a  human 
would  have  fared  under  similar  circumstances.  Of  the  two  hundred  new  concepts  it  defined, 
about  130  were  acceptable  -  in  the  sense  that  one  can  defend  AM’s  reasoning  in  at  least 
exploring  them;  in  the  sense  that  a  human  mathematician  might  have  considered  them.  Of 
these  "winners",  about  two  dozen  were  significant  -  that  is,  useful,  catalytic,  well-known  by 
human  mathematicians,  etc.  Unfortunately,  the  sixty  or  seventy  concepts  which  were  losers 
were  real  losers.  In  this  respect,  AM  fell  far  below  the  standards  a  mathematician  would  set 
for  acceptable  behavior:  all  his  failures  should  have  at  least  seemed  promising  at  the 
beginnjng.  Half  of  AM’s  adventures  were  poorly  grounded,  and  (perhaps  due  to  a  lack  of 
intuition)  AM  bothered  with  concepts  which  were  "obviously"  trivial:  the  set  of  even  primes, 
the  set  of  numbers  with  only  one  divisor,  etc.  The  human  mathematician  would 
momentarily  consider  many  poor  courses  of  action,  whereas  AM  on  the  other  hand 
managed  to  avoid  truly  lunatic  activities  without  even  momentary  consideration  of  them, 
But  a  fiUi.ian  v-’r-ild  only  spend  a  significant  amount  of  time  on  very  promising  tasks,  and 
AM  wasted  a  huge  amount  oi  time  cn  tasks  which  a  human  would  have  quickly  recognized 
as  dead-ends. 

Once  again  we  must  observe  that  the  quality  of  the  route  is  a  function  of  the  quaiiiy  of  the 
heuristics.  If  there  are  many  clever  little  rules,  then  the  steps  AM  takes  will  often  seem 
clever  and  sophisticated.  If  the  rules  superimpose  nicely.  Joining  together  to  collectively 
buttress  some  specific  activity,  then  their  effectiveness  may  surprise  -  and  surpass  —  their 
creator. 

Such  moments  of  great  insight  (i.e.,  where  AM’s  reasoning  surpassed  mine)  did  occur, 
although  rarely.  Both  of  AM’s  "big  discoveries"  started  by  its  examining  concepts  I  felt 
weren’t  really  interesting.  For  example,  I  didn’t  like  AM  spending  so  much  time  worrying 
about  numbers  with  many  divisors;  I  "knew"  t*-  ‘  the  converse  concept  of  primes  was 


g 

not  nocitttrily  WOULD  be  to  judged.  Humane  may  very  well  contider  an  incredible  number  of  eilly  ideae  before  the  right 
pair  of  "hooked  atome"  collide  into  a  eentible  thought,  which  ie  then  contidered  in  full  coneciouenete  If,  like 
humans,  AU  was  capable  of  doing  this  processing  in  a  sufficiently  brief  period  of  real  time,  it  would  not 
reflect  ill  on  He  evaluation  Of  course,  this  may  simply  be  the  DEFINITION  of  "sufficiently  brief". 

'®  Or  whether  that  even  makes  sense  to  consider.  Comparisons  with  msthematicians  would  be  desirable,  but  are  beyond  the 
ecope  of  this  investigation. 
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infinitely  more  valuable.  And  yet  AM  saw  no  reason  to  give  up  on  max imally-di visible 
numbers;  it  had  several  good  reasons  for  continuing  that  inquiry  (they  were  the  converse  to 
primes  which  had  already  proved  interesting,  their  frequency  within  the  integers  was 
neither  very  high  nor  very  low  nor  very  regular,  their  definition  was  simple,  they  were 
extremals  of  the  interesting  operation  "Divisors-of",  etc.,  etc.)  Similarly,  I  "knew"  that 
Goldbach’s  conjecture  was  useless,  so  1  was  unhappy  that  AM  was  bothering  to  try  to  apply 
it  in  the  domain  of  geometry.  In  both  cases,  AM’s  reasons  for  its  actions  were  unassailable, 
and  in  fact  it  did  discover  some  interesting  new  ideas  both  times. 


Sometimes  AM’s  behavior  was  displeasing,  even  though  It  wasn’t  "erring".  Occasionally  It 
was  simultaneously  developing  two  mini-theories  (say  primes  and  maximally-divisibles). 
Then  it  might  pick  a  task  or  two  dealing  with  one  of  these  topics,  then  a  task  or  two  dealing 
with  the  other  topic,  etc.  The  task  picked  at  each  moment  would  be  the  one  with  the  highest 
priority  value.  As  a  theory  is  developed,  the  intcrestingness  of  its  associated  tasks  go  up 
and  down;  there  may  be  doldrums  for  a  .hit,  just  before  falling  into  the  track  that  will  lead 
to  the  discovery  of  a  valuable  relationship.  During  these  temporary  lags,  the  interest  value 
of  tasks  related  to  the  other  theory’s  concepts  will  appear  to  have  a  higher  priority  value: 
i.e.,  better  reasons  supporting  it.  So  AM  would  then  skip  over  to  one  of  those  concepts, 
develop  it  until  its  doldrums,  then  return  to  the  first  one,  etc.  Most  humans  found  this 
behavior  unpalatable"  because  AM  had  no  compunction  about  skipping  from  one  topic  to 
another.  Humans  have  to  retune  their  minds  to  do  this  skipping,  and  therefore  treat  it 
much  more  seriously.  For  that  reason,  AM  was  given  an  extra  mobile  reason  to  use  for 
certain  tasks  on  its  agenda:  "focus  of  attention".  Any  task  with  the  same  kind  of  topic  as  the 
ones  just  executed  are  given  this  extra  reason,  and  it  raises  their  priority  values  a  little. 
This  was  enough  sometimes  to  keep  AM  working  on  a  certain  mini-theory  when  it 
otherwise  would  have  skipped  somewhere  else. 

The  above  "defect"  is  a  cute  little  kind  of  behavior  AM  exhibited  which  was  non-human 
but  not  clearly  "wrong".  There  were  genuine  bad  moments  also,  of  course.  For  example, 
AM  became  very  excited'^  when  the  conjunction  of  "empty-set"  and  other  concepts  kept 
being  equal  to  empty-set.  AM  kept  repeating  conjunctions  of  this  form,  rather  than  stepping 
back  and  generalizing  this  data  into  a  (phenomenological)  conjecture.  Similar  blind  looping 
behavior  occurred  when  AM  kept  composing  Compose  with  itself,  over  and  over.  In 
general,  one  could  say  that  "regular"  behavior  of  any  kind  signals  a  probable  fiasco.  A 
heuristic  rule  to  this  effect  halted  most  of  these  disgraceful  antics.  This  rule  had  to  be 
careful,  since  it  was  almost  the  antithesis  of  the  "focus  of  attention"  idea  mentioned  in  the 
preceding  paragraph.  Together,  those  two  rules  seem  to  say  that  you  should  continue  on 
with  the  kind  of  thing  you  were  just  doing,  but  not  for  too  long  a  time. 

The  moments  of  insight  were  2  in  number;  the  moments  of  stupid  misdirection  were  about 
twenty  times  as  many. 

AM  has  very  few  heuristics  for  deciding  that  something  was  uninteresting,  that  work  on  it 


*  *  Althouih  il  might  b«  lh«  from  «  dyntmic  m«n«g«m«nt  point  of  vww,  it  probibly  would  bo  wrong  in  tho  long  run. 
Mojor  odvincoo  roilly  do  hivo  kill*  in  thoir  dovolopmont. 

1 2 

Plot**  oxcu**  thi*  anthropomorphitm.  Ttchnically,  w*  may  tay  that  th*  priority  valu*  of  th*  b*«t  job  on  tho  agonda  i* 
tho  "toval  of  oxcitamont"  of  AM.  700  or  hightr  ia  called  "axeitamont",  on  a  acalo  of  0-1000. 
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should  halt  for  a  long  time.  Rather,  AM  simply  won’t  have  anything  positive  to  say  about 
that  concept,  and  other  concepts  are  explored  instead,  essentially  by  default.  Each  concept 
has  a  worth  component  which  corresponded  to  its  right  to  life  (its  right  to  occupy  storage  in 
core).  This  number  slowly  declines  with  time,  and  is  raised  whenever  something  interesting 
happens  with  that  concept.  If  it  ever  falls  below  a  certain  threshhold,  and  if  space  is 
exhausted'^,  then  the  concept  is  forgotten:  its  list  cells  are  garbage  collected,  and  all 
references  to  it  are  erased,  save  those  which  will  keep  it  from  being  re-created.  This  again 
is  not  purposeful  forgetting,  but  rather  by  default;  not  because  X  is  seen  as  a  dead-end,  but 
simply  because  other  concepts  seem  .<o  much  more  interesting  for  a  long  time. 

Thus  AM  did  not  develop  the  sixty  "losers"  very  much:  they  ended  up  with  an  average  of 
only  1.5  tasks  relevant  to  them  ever  having  been  chosen.  The  "winners"  averaged  about 
twice  as  many  tasks  which  helped  fill  them  out  more.  Also,  the  worth  ratings  of  the  losers 
were  far  below  those  of  the  winners.  So  AM  really  did  judge  the  value  of  its  new  concepts 
quite  well. 

The  final  aspect  of  this  important  dimension  of  evaluation  is  the  quality  of  the  reasons  AM 
used  to  support  each  task  it  chose  to  work  on.  Again,  the  English  phrases  corresponded 
quite  nicely  to  the  "real"  reasons  a  human  might  give  to  Justify  why  something  was  worth 
trying,  and  the  ordering  of  the  tasks  on  the  agenda  was  rarely  far  off  from  the  one  that  I 
would  have  picked  myself.  This  was  perhaps  AM’s  greatest  success:  the  rationality  of  its 
actions. 


7.1.4.  The  Character  of  the  User-System  Interactions 

AM  is  not  a  "user-orienteo"  system  There  were  many  nice  human-interaction  features  in 
the  original  grandiose  proposal  for  AM  which  never  got  off  the  drawing  board.  At  the 
heart  of  these  features  were  two  assumptions: 

1.  The  user  must  understand  AM,  and  AM  must  likewise  have  a  good  model  of  the 

particular  human  using  AM.  The  only  time  either  should  initiate  a  message  is 
when  his  model  of  the  other  is  not  what  he  wants  that  model  to  be.  In  that  case, 
the  message  should  be  specifically  designed  to  fix  that  discrepancy..’^ 

2.  Each  kind  of  message  which  is  to  pass  between  AM  and  its  user  should  have  its 
own  appropriate  language.  Thus  there  should  be  a  terse  comment  language, 
whereby  the  user  can  note  how  he  feels  about  what  AM  is  doing,  a  questioning 
language  for  either  party  to  get/give  reasons  to  the  other,  a  picture  language  for 
communicating  certain  relationships,  etc. 

Neither  of  these  ideas  ever  made  it  into  the  LISP  code  that  is  now  AM,  although  they  are 
certainly  not  prohibited  in  any  way  by  AM’s  design.  It  would  be  a  separate  project,  at  or 
above  the  level  of  a  master’s  thesis,  for  someone  to  build  a  nice  user  interface  for  AM'^ 

13 

No  concopit  wtr«  forgclltn  in  thi>  wiy  until  n«tr  th«  ond  of  AM'*  runt,  whin  AM  would  utuilly  collapii  from  iiviral 
cauaaa  including  lack  of  tpaci. 

1 4 

Thii  idia  waa  motivatid  by  a  licturi  givin  in  1975  by  Tarry  Winograd 

15 

I  am  not  actually  calling  for  thia  to  bi  doni,  manly  indicating  tha  magnituda  of  tha  affort  involvad.  A  VERY  nica  uaar 
Intarfaca  might  ba  much  hardar,  at  tha  laval  of  a  diaaartation. 
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As  one  might  expect,  the  reason  for  this  atrophy  is  simply  because  very  little  guidance  from 
the  user  was  needed  by  AM.  In  fact,  ail  the  discoveries,  epu  time  quotes,  etc.  mentioned  in 
this  document  are  taken  from  totally  unguided  runs  by  AM.  If  the  user  guides  as  well  as  he 
can,  then  about  a  factor  of  2  or  3  speedup  Is  possible.  Of  course,  this  assumes  that  the  user 
is  dragging  AM  directly  along  a  line  of  development  he  knows  will  be  successful.  The  user’s 
"reasons"  at  each  step  are  based  essentially  on  hindsight.  Thus  this  is  not  at  all  "fair".  If 
AM  ever  becomes  more  user-oriented,  it  would  be  nice  to  let  children  (say  6-12  years  old) 
experiment  with  it,  to  observe  them  working  with  it  in  domains  unfamiliar  to  either  of 
them.’® 

The  user  can  "kick"  AM  in  one  direction  or  another,  e.g.,  by  interrupting  and  telling  AM 
that  Sets  are  more  interesting  than  Numbers’^  Even  in  that  particular  case,  AM  fails  to 
develop  any  higher-level  set  concepts  (diagonalization,  infinite  sets,  etc.)  and  simply  wallows 
around  in  finite  set  theory  (de  Morgan’s  laws,  associativity  of  Union,  etc.).  When  geometric 
concepts  are  Input,  and  AM  is  kicked  in  that  direction,  much  nicer  results  are  obtained.  See 
the  report  on  the  Geometry  experiment,  page  133. 

There  is  one  important  result  to  observe:  the  very  best  examples  of  AM  in  action  were 
brought  to  full  fruition  only  by  a  human  developer.  That  is,  AM  thought  of  a  couple  great 
concepts,  but  couldn’t  develop  them  well  on  its  own.  A  human  (the  author)  then  took  them 
and  worked  on  them  by  hand,  and  interesting  results  were  achieved.  These  results  could  be 
told  to  AM,  who  could  then  go  off  and  look  for  new  concepts  to  investigate.  This 
interaction  is  of  course  at  a  much  lower  frequency  than  the  kind  of  rapidfire 
question/answering  talked  about  above.  Yet  it  seems  that  such  synergy  may  be  the  ultimate 
mode  of  AM-like  systems. 


7.1.5.  AM’s  Intuitive  Powers 


Intuitive  conviction  surpasses  logic  as  the  brilliance  of  the  sun  surpasses  the  pale 
light  of  the  moon. 

—  Kline 


Let  me  hasten  to  mention  that  the  word  "intuitive"  in  this  subsection’s  title  is  not  related  to 
the  (currently  non-existent)  "Intuitions"  facets  of  the  concepts.  What  is  meant  is  the  totality 
of  plausible  reasoning  which  AM  engages  in:  empirical  induction,  generalization, 
specialization,  maintaining  reasons  for  jobs  on  the  agenda  list,  creation  of  analogies  between 
bunches  of  concepts,  etc. 


®  Sttrrtd  (•)  ixtrci$«  for  th*  rtjdtr-  carry  ovt  tueh  a  project  on  a  atatiatically  significant  tampit  of  children,  wait  thirty 
yeart,  and  obterve  the  incidence  of  mathematicians  and  aeientiata  in  general,  compared  to  the  national 
averagee.  Within  whatever  occupation  they’ve  chosen,  rate  their  creativity  and  productivity. 

’  ^  To  actually  do  this,  the  user  will  type  control-1  to  interrupt  AM.  He  then  types  I,  meaning  "alter  the  interest  of",  followed 
by  the  word  "Sets"  AM  then  asks  whether  this  is  to  be  raised  or  lowered.  He  types  back  R,  and  Akf  askt 
how  much,  on  a  I- 10  scale.  He  replies  9,  say,  and  then  repeats  this  process  for  the  concept  "Numbers". 
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AM  only  considers  conjectures  which  have  been  explicitly  suggested;  cither  by  empirical 
evidence,  by  analogy,  or  (de-implemented  now:)  by  Intuition  facets.  Once  a  conjecture  has 
been  formulated,  it  is  tested  in  all  ways  possible:  new  experimental  evidence  is  sought 
(especially  extreme  cases),  it  is  examined  formally'*  to  see  if  it  follows  from  already-known 
conjectures,  etc. 

Because  of  this  grounding  in  plausibility,  the  only  conjectures  the  user  ever  sees  (the  ones 
AM  is  testing)  are  quite  believable.  If  they  turn  out  to  be  false,  both  he  and  AM  are 
surprised.  For  example,  both  AM  and  the  user  were  disappointed  when  nothing  came  out 
of  the  concept  of  Uniquely-prime-addable  numbers  (positive  integers  which  can  be 
represented  as  the  sum  of  two  primes  in  precisely  one  way).  Several  conjectures  were 
proposed  via  analogy  with  unique  prime  factorization,  but  none  of  them  held 
experimentally.  Each  of  them  seemed  worth  investigating,  to  both  the  user  and  the 
system.'* 

AM's  estimates  of  the  value  of  each  task  it  attempts  were  often  far  off  from  what  hindsight 
proved  their  true  values  to  be.  Yet  this  was  not  so  different  from  the  situation  a  real 
researcher  faces,  and  it  made  little  difference  on  the  discoveries  and  failures  of  the  system. 
AM  occasionally  mismanaged  its  resources  due  to  errors  in  these  estimates.  To  correct  for 
such  erroneous  prejudgments,  heuristic  rules  were  permitted  to  dynamically  alter  the 
time/space  quanta  for  the  current  task.  If  some  interesting  new  result  turned  up,  then  some 
extra  resources  would  be  allotted.  If  certain  heuristics  failed,  they  could  reduce  the  time 
limits,  so  not  as  much  total  epu  time  would  be  wasted  on  this  loser. 

An  example  of  a  nice  conjecture  is  the  unique  factorization  one.  A  nice  analogy  was  the 
one  between  angles  and  numbers  (leading  to  the  application  of  Goldbach’s  conjecture). 
Another  nice  analogy  was  between  numbers  and  bags  (and  hence  between  bag-operations 
and  what  we  commonly  call  arithmetic  operations). 

Some  poor  analogies  were  considered,  like  the  one  between  bags  and  singleton-bags.  The 
ramifications  of  this  analogy  were  painfully  trivial^*. 


7.1.6.  Experiments  on  AM 

The  experiments  described  in  Section  6.2  (page  125  ff)  provide  some  results  relevant  to  the 
overall  value  of  the  AM  system.  The  reader  should  consult  that  section  for  details;  neither 
the  experiments  nor  their  results  will  be  repeated  here.  A  few  conclusions  will  be 
summarized,  to  show  that  AM  fared  well  in  this  dimension  of  evaluation. 

The  worth-numbering  scheme  for  the  concepts  is  fairly  robust:  even  when  all  the  concepts’s 


1 8 

Currantly,  thi*  n  don*  in  trivial  wayt  An  op«n  probitm,  which  i*  under  attack  now,  i*  to  add  mor*  powerful  formal 
reatonint  abilitiea  to  AM 

in 

'  It  i*  etill  not  known  whether  there  it  anything  intereating  about  that  concept  or  not 

?0 

'  The  bag-operation*,  applied  to  emgletona,  did  not  produce  eingletona  at  their  reault  <x)U(y)  it  (x,y)  which  la  not  a 
eingleton  Whether  they  did  or  not  depended  only  on  the  equality  or  inequality  of  the  two  argument*  There 
were  many  tiny  conjecture*  propoitd  which  merely  re-echoed  thi*  general  conclueion 
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worths  are  initialized  at  the  same  value,  the  performance  of  AM  doesn’t  collapse,  although 
ic  is  noticeably  degraded. 

Certain  mutilations  of  the  priority-value  scheme  for  tasks  on  the  agenda  will  cripple  AM, 
but  it  can  resist  most  of  the  small  changes  tried  in  various  experiments. 

Sometimes,  removing  just  a  single  concepts  (e.g.,  Equality)  was  enough  to  block  AM  from 
discovering  some  valuable  concepts  it  otherwise  got  (in  this  case.  Numbers).  This  makes 
Akf’s  behavior  sound  very  fragile,  like  a  slender  chain  of  advancement.  But  on  the  other 
hand,  many  concepts  (e.g.,  TIMES,  Timberline,  Primes^')  were  discovered  in  several 
independent  ways.  If  AM’s  behavior  is  a  chain,  it  is  multiply-stranded^^.  More 
experiments  of  this  sort  should  be  done  to  test  this  general  conclusion  about  AM. 

The  heuristics  are  specific  to  their  stated  domain  of  applicability.  Thus  when  working  in 
geometry,  the  Operation  heuristics  were  Just  as  useful  as  they  were  when  AM  worked  in 
elementary  set  theory  or  number  theory.  The  set  of  facets  seemed  adequate  for  those 
domains,  too.  The  Intuition  facet,  which  was  rejected  as  a  valid  source  of  information  about 
sets  and  numbeis,  might  have  been  more  acceptable  in  geometry  (e.g.,  something  similar  to 
Gelerntcr’s  model  of  a  geometric  situation). 

All  in  all,  then,  we  conclude  that  AM  was  fairly  tough,  and  about  as  general  as  its  heuristics 
claimed  it  was.  AM  is  not  invincible,  infallible,  or  universal.  Its  strength  lies  in  careful  use 
of  heuristics.  If  there  aren’t  enough  domain-specific  heuristics  around,  the  system  will  simply 
not  perform  well  In  that  domain.  If  the  heuristic-using  control  structure  of  AM  is  tampered 
with*^  there  is  some  chance  of  losing  vital  guiding  information  which  the  heuristics  would 
otherwise  supply. 


7.1.7.  How  to  Perform  Experiments  on  AM 

The  very  fact  that  the  kinds  ot  experiments  mentioned  in  the  l?rt  section  (and  described  in 
detail  in  Section  6.2)  can  be  "set  up"  and  performed  on  AM,  reflects  a  nice  quality  of  the 
AM  program. 

Most  of  those  experiments  took  only  a  matter  of  minutes  to  set  up,  only  a  few  tiny 
modifications  to  AM.  For  example,  the  one  where  all  the  Worth  ratings  were  initialized  to 
the  same  value  was  done  by  evaluating  the  single  LISP  expression: 

(MAPC  CONCEPTS  '(X  (e)  (PUT  e  'Worth  200))) 


21 

Primat  wt»  diteovtrad  indapand.-ntly  i$  followt:  ill  numbtrt  (>0)  w«ra  tttn  to  b«  rtprottnttbit  «•  tha  turn  of  amollar 
numbar*,  Add  wit  known  to  ba  tnilo|ou»  to  TIMES:  But  not  ill  numbari  (>l)  ippairid  to  bo  rapraiantibla 
11  tha  product  of  two  imillar  onai;  Rula  numbar  81  trigiarad  (taa  Appandix  3,  piga  243),  and  AM 
dafinad  tha  aat  of  axcaptioni:  tha  aat  of  numbari  which  could  not  ba  axpritiad  ai  tha  product  of  two 
amillar  onai:  ia,  tha  primal. 

22 

axcapt  for  i  faw  walk  ipoti,  lika  Numbara.  If  thay  don't  gat  diicovarad,  AM  loiai 
23 

a  g.,  trait  til  raatont  la  aquivilant,  to  ''ou  juit  COUNT  tha  numbar  of  raiiona  •  taik  hit,  to  datarmina  ita  priority  on 
tha  tgandi. 
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Similarly,  here  is  how  AM  was  modified  to  treat  all  tasks  as  if  they  had  equal  value:  the 
function  Pick-task  has  a  statement  of  the  form 

(SETQ  Currenl-task  (First-msmbsr-of  Agenda)) 

All  that  was  necessary  was  to  replace  the  call  on  the  function  "Firsl-mernber-of"^^  by  the 
function  "Random-member-of". 

Even  the  most  sophisticated  experiment,  the  introduction  of  a  new  bunch  of  concepts  - 
those  dealing  with  geometric  notions  like  Between,  Angle,  Line  -  took  only  a  day  of 
conscious  work  to  set  up. 

Of  course  running  the  experiment  involves  the  expenditure  of  hours  of  epu  time,  so  only  a 
limited  number  were  actually  performed.^® 

There  are  certain  experiments  one  can’t  easily  perform  on  AM:  removing  all  its  heuristics, 
for  example.  Most  heuristic  search  programs  would  then  wallow  around,  displaying  just 
how  big  their  search  space  really  was.  But  AM  would  just  sit  there,  since  it’d  have  nothing 
plausible  to  do. 

Many  other  experiments,  while  cute  and  easy  to  set  up,  are  quite  costly  in  terms  of  epu  time. 
For  example,  the  class  of  experiments  of  the  form:  "remove  heuristics  x,  y,  and  z,  and 
observe  the  resultant  affect  on  AM’s  behavior".  This  observation  would  entail  running  AM 
for  an  hour  or  two  of  epu  time!  Considering  the  number  of  subsets  of  heuristics,  not  all 
these  questions  are  going  to  get  answered  in  our  universe’s  lifetime.  Considering  the  small 
probable  payoff  from  any  one  such  experiment,  very  few  should  actually  be  attempted. 

One  nice  experiment  would  be  to  monitor  the  contribution  ear^-  heuristic  is  making.  That 
is,  record  each  time  it  is  used  and  record  the  final  outcome  of  its  activation  (which  may  be 
several  cycles  later).  Unfortunately,  AM’s  heuristics  are  not  ail  coded  as  separate  Lisp 
entities,  which  one  could  then  "trace".  Rather,  they  are  often  interwoven  with  each  other 
into  large  program  pieces.  So  this  experiment  can’t  be  easily  set  up  and  run  on  AM. 

Most  of  the  experiments  one  could  think  of  can  be  quickly  set  up  -  but  only  by  someone 
familiar  with  the  LISP  code  of  AM.  It  would  be  quite  hard  to  modify  AM  so  that  the 
untrained  user  could  easily  perform  these  experiments.  Essentially,  that  would  demand  that 
AM  have  a  deep  understanding  of  its  own  structure.  This  is  of  course  desirable,  fascinating, 
challenging,  but  wasn’t  part  of  the  design  of  AM.^^ 


In  LISP,  lhi»  function  i»  icluilly  ibbrovitttd  "CAR”. 

OR 

Tho««  dt«crib«d  in  the  lift  chipter.  The  teriet  of  expenmente  begin  it  the  time  time  thit  thii  document  wi«  being 
written,  ind  wii  intended  originilly  only  le  i  dlveriion  from  the  tedium  of  writing  The  intereeting  chiricter 
of  their  riiulti  convinced  me  they  ihouid  be  included,  even  though  they  ire  few  in  number  end  quite 
incomplete 
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7.1.8.  Future  Implications  of  this  Project 

One  harsh  measure  of  AM  would  be  to  demand  what  possible  applications  it  will  have. 

This  really  means  (i)  the  uses  for  the  AM  system,  (ii)  the  uses  for  the  ideas  of  how  to  create 

such  systems,  (iii)  conclusions  about  math  and  science  one  can  draw  from  experiments  with 

AM. 

Here  are  some  of  these  implications,  both  real  and  potential: 

1.  New  tools  for  computer  scientists  who  want  to  create  large  knowledge-based  systems  to 

emulate  some  creative  human  activity. 

la.  The  modular  representation  of  knowledge  that  AM  uses  might  prove  to  be 
effective  in  any  knowledge-based  system.  Division  of  a  global  problem  into  a  multitude 
of  small  chunks,  each  of  them  of  the  form  of  setting  up  one  quite  local  "expert"  on  some 
concept,  is  a  nice  way  to  make  a  hard  task  more  managable.  Conceivably,  each  needed 
expert  could  be  filled  in  by  a  human  who  really  is  an  expert  on  that  topic.  Then  the 
global  abilities  of  the  system  would  be  able  to  rely  on  quite  sophisticated  local  criteria. 
Fixing  a  set  of  facets  once  and  for  all  permits  effective  inter-module  communication. 

lb.  Some  ideas  may  carry  over  unchanged  into  many  fields  of  human  creativity, 
wherever  local  guiding  rules  exist.  These  include:  (a)  ideas  about  heuristics  having 
domains  of  applicability,  (b)  the  policy  of  tacking  them  onto  the  most  general  knowledge 
source  (concept,  module)  they  are  relevant  to,  (c)  the  rippling  scheme  to  locate  relevant 
knowledge,  etc., 

2.  A  body  of  heuristics  which  can  be  built  upon  by  others. 

2a.  Most  of  the  particular  heuristic  judgmental  criteria  for  interestingness,  utility, 
etc.,  might  be  valid  in  developing  theorizers  in  other  sciences.  Recall  that  each  rule  has 
its  domain  of  applicability;  many  of  the  heuristics  in  AM  are  quite  general. 

2b.  Just  within  the  small  domain  In  which  AM  already  works,  this  base  of 
heuristics  might  be  enlarged  through  contact  with  various  mathematicians.  If  they  are 
willing  to  introspect  and  add  some  of  their  "rules"  to  AM’s  existing  base,  it  might 
gradually  grow  more  and  more  powerful. 

2c.  Carrying  this  last  point  to  the  limit  of  possibility,  one  might  imagine  the 
program  possessing  more  heuristics  than  any  single  human.  Of  course,  AM  as  it  stands 
now  is  missing  so  much  of  the  'human  element’,  the  life  experiences  that  a 
_  mathematician  draws  upon  continually  for  inspiration,  that  merely  amassing  more 
heuristics  won’t  automatically  push  It  to  the  level  of  a  super-human  intelligence. 
Another  far-out  scenario  is  that  of  the  great  mathematicians  of  each  generation  pouring 
their  individual  heuristics  into  an  AM-like  system.  After  a  few  generations  have  come 
and  gone,  running  that  program  could  be  a  valuable  way  to  bring  about  ‘interactions* 
between  peopk  who  were  not  contemporaries. 

3.  New  and  better  strategies  for  math  educators,  [optional] 

3a.  Since  the  key  to  AM’s  success  seems  to  be  it'  lieuristics,  and  not  the  particular 
concepts  it  knows,  th^  vhole  orientation  of  ruathematics  education  should  perhaps  be 
modified.  *-  p-nvide  experiences  fc.'  tne  student  which  will  build  up  these  rules  in  his 
mind.  Learning  a  new  theorem  is  worth  much  less  than  learning  a  new  heuristic  which 
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V\n v”. -■’  .  •  V-  ‘  '  ^N  s'H 


Chapter  7  AM:  Discovary  in  Mathematics  a«  Heuriatic  Search  *1^7- 

lets  you  discover  new  theorems.^^  I  am  far  from  the  first  to  urge  such  a  revision  (see, 
e.g.,  [Koestler  67],  p.265,  or  see  [Papert  721). 

3b.  If  the  repertoire  of  intuition  (simulated  real-world  scenarios)  were  sufficient  for 
AM  to  develop  elementary  concepts  of  math,  then  educators  should  ensure  that  children 
(4-6  years  old)  are  thoroughly  exposed  to  those  scenarios.  Such  activities  would  include 
seesaws,  slides,  piling  marbles  into  pans  of  a  balance  scale,  comparing  the  heights  of 
towers  built  out  of  cubical  blocks,  solving  a  jigsaw  puzzle,  etc.  Unfortunately,  AM  failed 
to  show  the  value  of  these  few  scenarios.  This  was  a  potential  application  which  was 
not  confirmed. 

3c.  One  use  for  AM  itself  would  be  as  a  "fun"  teaching  tool.  If  a  very  nice  user 
interface  is  constructed,  AM  could  serve  as  a  model  for,  say,  college  freshmen  with  no 
math  research  experience.  They  could  watch  AM,  see  the  kinds  of  things  it  does,  play 
with  it,  and  perhaps  get  a  real  flavor  for  (and  get  turned  on  by)  doing  math  research.  A 
vast  number  of  brilliant  minds  are  too  turned  off  by  high-school  drilling  and  college 
calculus  to  stick  around  long  enough  to  find  out  how  exciting  -  and  different  -  research 
math  is  compared  to  textbook  math. 

4.  Further  experiments  on  AM  might  tell  us  something  about  how  the  theory  formation  task 

changes  as  a  theory  grows  in  sophistication.  For  example,  can  the  same  methods  which 
lead  AM  from  premathematical  concepts  to  arithmetic  also  lead  AM  from  number 
systems  up  to  abstract  algebra?  Or  are  a  new  set  of  heuristic  rules  or  extra  concepts 
required?  My  guess  is  that  a  few  of  each  are  lacking  currently,  but  only  a  few.  There-  is 
a  great  deal  of  disagreement  about  this  subject  among  mathematicians.  liy  tracing 
along  the  development  of  mathematics,  one  might  categorize  discovciies  by  how  easy 
they  would  be  for  an  AM-like  system  to  find.  Sometimes,  a  discovery  required  the 
invention  of  a  brand  new  heuristic  rule,  which  would  clearly  be  beyond  AM  as 
currently  designed.  Sometimes,  discovery  is  b-ied  on  the  lucky  random  combination  of 
existing  concepts,  for  no  good  a  priori  reason.  It  would  be  instructive  to  find  out  how 
often  this  is  necessarily  the  case,  how  often  can’t  a  mathematical  discovery  be  motivated 
and  "explained"  using  heuristic  rules  of  the  kind  AM  possess^'s? 

5.  An  unsiicicipated  result  was  the  creation  of  new-to-Mankind  math  (both  directly  and  by 

defining  new,  interestin';  concepts  to  investigate  by  hand).  The  amount  of  new  bits  of 
mathematics  developed  to  date  is  minuscule. 

5b.  As  described  in  (2c)  above,  AM  might  absorb  heuristics  from  several 
individuals  and  thereby  integrate  their  particular  insights.  This  might  eventually  result 
in  new  mathematics  being  discovered. 

5b.  An  even  more  exciting  prospect,  which  never  materialized,  was  that  AM 
would  find  a  new  redivision  of  existing  concepts,  an  alternate  formulation  of  some 
established  theory,  much  like  Hamiltonian  mechanics  is  an  alternate  unification  of  the 
data  which  led  to  Newtonian  mechanics.  The  only  rudimentary  behavior  along  these 
lines  was  when  AM  occasionally  derived  a  familiar  concept  in  an  abnormal  way  (e.g., 
TIMES  was  derived  in  four  ways;  Prime  pairs  were  noticed  by  restricting  Addition  to 
primes). 
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7.1.9.  Open  Problems:  Suggestions  for  Future  Research 

While  AM  can  and  should  stand  as  a  complete  research  project,  part  of  its  value  will  stem 
from  whatever  future  studies  are  sparked  by  it.  Of  course  the  "evaluation"  of  AM  alnr.g 
this  dimension  must  wait  for  years,  but  even  at  the  present  time  several  such  open  problems 
come  to  mind: 

•  Devise  Meta-heuristics,  rules  capable  of  operating;  on  and  synthesizing  new  heuristic 
rules.  AM  has  shown  the  solution  chis  problem  to  be  both  nontrivial  and 
indispensable.  AM’s  progres.<  giound  to  a  halt  because  fresh,  powerful  heuristics 
were  never  produced  Tne  next  point  suggests  that  the  same  need  for  new  rules 
exists  in  matberiiatics  as  a  whole: 

•  Examine  the  history  of  mathematics,  and  gradually  build  up  a  list  of  the  heuristic 
rules  used.  Does  the  following  thesis  have  any  validity:  "The  development  of 
mathematics  is  essentially  the  development  of  new  heuristics."  That  is,  can  we  'factor 
out’  all  the  discoveries  reachable  by  the  set  of  heuristics  available  (known)  to  the 
mathematicians  at  some  time  in  history,  and  then  explain  each  new  big  discovery 
as  requiring  the  synthesis  of  a  brand  new  heuristic?  For  example,  Bolyai  and 
Lobachevsky  did  this  a  century  ago  when  they  decided  that  counter-intuitive 
systems  might  still  be  consistent  and  interesting.  Non-Euclidean  geometry  resulted, 
and  no  mathematician  today  would  think  twice  about  using  the  heuristic  they 
developed.  Einstein  invented  a  new  heuristic  more  recently,  when  he  dared  to 
consider  that  counter-intuitive  systems  might  actually  have  physical  reality.^*  What 
was  once  a  bold  new  method  is  now  a  standard  tool  in  theoretical  physics. 

•  In  a  far  less  dramatic  vein,  a  hard  open  problem  is  that  of  building  up  a  body  of 
rules  for  symbolically  instantiating  a  definition  (a  LISP  predicate),  These  rules  moy 
be  structured  hierarchically,  so  that  rules  specific  to  operating  on  ‘operations  whost 
domain  and  range  are  equal’  may  be  gathered.  Is  this  set  finite  and  managable;  i.e., 
does  some  sort  of  "closure"  occur  after  a  few  hundred  (thousand?)  such  rules  are 
assembled? 

•  More  generally,  we  can  ask  for  the  expansion  of  all  the  heuristic  rules,  of  all 
categories.  This  may  be  done  by  eliciting  them  from  famous  mathematicians,  or 
automatically  by  the  application  of  very  sophisticated  meta-heuristics.  Some 
categories  of  rules  include:  how  to  generalize/specialize  definitions,  how  to  find 
examples  of  a  given  concept,  how  to  optimize  LISP  algorithms. 

•  Experiments  can  be  done  on  AM.  A  few  have  been  performed  already,  many  more 
are  proposed  in  Section  6.2,  and  no  doubt  some  additional  ones  have  already 
occurred  to  the  reader. 

•  Extend  the  analysis  already  begun  (see  p.  59)  of  the  set  of  heuristics  AM  possesses. 
One  reason  for  such  an  analysis  would  be  to  achieve  a  better  understanding  of  the 
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contribution  of  the  heuristics.  In  some  sense,  the  heuristics  and  the  choice  of 
starting  concepts  "encode”  the  discoveries  which  AM  makes,  and  the  way  it  makes 
them.  A  better  understanding  of  that  encoding  may  lead  to  new  ideas  for  AM  and 
for  future  AM-like  systems. 

Rewrite  AM.  In  Chapter  1,  on  page  9,  it  was  pointed  out  that  there  are  two 
common  species  of  heuristic  search  programs.  One  type  has  a  legal  move 
generator,  and  heuristics  to  constrain  it.  The  second  type,  including  AM,  has  only 
a  set  of  heuristics,  and  they»act  as  plausible  move  generators.  Since  AM  seemed  to 
create  new  concepts,  propose  new  conjectures,  and  formulate  new  tasks  in  a  very 
few  distinct  ways,  it  might  very  well  be  feasible  to  find  a  purely  syntactic  "legal 
move  generator"  for  AM,  and  to  convert  each  existing  heuristic  into  a  form  of 
constraint.  In  that  case,  one  could,  e.g.,  remove  all  the  heuristics  and  still  see  a 
meaningful  (if  explosive)  activity  proceed.  There  might  be  a  few  surprises  down 
that  path. 

A  more  tractible  project,  a  subset  of  the  former  one,  would  be  to  recode  Just  the 
conjecture-finding  heuristics  as  constraints  on  a  new,  purely  syntactic  "legal 
conjecture  generator".  A  simple  Geiierate-and-Tcst  paradigm  would  be  used  to 
synthesize  and  examine  large  numbers  of  conjectures.  Again,  removing  all  the 
heuristics  would  be  a  worthwhile  experiment. 

At  the  reaches  of  feasability,  one  can  imagine  trying  to  extend  AM  into  more  and 
more  fields,  into  less-formalizable  domains.  International  politics  has  already  been 
suggested  as  a  very  hard  future  applications  area. 

Abstracting  that  last  point,  try  to  build  up  a  set  of  criteria  which  make  a  domain 
ripe  for  automating  (e.g.,  it  possesses  a  strong  theory,  it  is  knowledge-rich  (many 
heuristics  exist),  the  perfr.mance  of  the  professionals/experts  is  much  better  than 
that  of  the  typical  practitioners,  the  new  discoveries  in  that  field  all  fall  into  a  small 
variety  of  syntactic  formats,...?).  Initially,  this  study  might  help  humans  build  better 
and  more  appropriate  scientific  discovery  programs.  Someday,  it  might  even  permit 
the  creation  of  an  automatic-theory-formation-program-writer. 

The  interaction  between  AM  and  the  user  is  minimal  and  painful.  Is  there  a  more 
effective  language  for  communication?  Should  several  languages  exist,  depending 
on  the  type  of  message  to  be  sent  (pictures,  control  characters,  a  subset  of  natural 
language,  induction  from  examples,  etc.)?  Can  AM’s  output  be  raised  in 
sophistication  by  introducing  an  internal  model  of  the  user  and  his  state  of 
knowledge  at  each  moment? 

Human  protocol  studies  may  be  appropriate,  to  test  out  the  model  of  mathematical 
research  which  AM  puts  forward.  Are  the  sequences  of  actions  similar?  Are  the 
mistakes  analogous?  Do  the  pauses  which  the  humans  emit  quantitatively 
correspond  to  AM’s  periods  of  gathering  and  running  'Sugge.st’  heuristics? 

Can  the  idea  of  Intuition  functions  be  developed  into  a  useful  mechanism?  If  not, 
how  else  might  real-world  experiences  be  made  available  to  an  automated 
researcher  to  draw  upon  (for  analogies,  to  base  new  theories  upon)?  Could  one 
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interface  physical  effectors  and  receptors  and  quite  literally  allow  the  program  to 
‘play  around  in  the  real  world’  for  his  analogies? 

•  Most  of  the  'future  implications’  discussed  in  the  last  section  suggest  future  activities 
(e.g.,  new  educational  experiments  and  techniques). 

•  Most  of  the  'limiting  assumptions’  discussed  in  a  later  section  (page  157)  can  be 
tackled  with  today’s  techniques  (plus  a  great  deal  of  effort).  Thus  each  of  them 
counts  as  an  open  problem  for  research.  « 

•  Perform  an  information-theoretic  analysis  on  AM.  What  is  the  value  of  each 
heuristic?  the  new  information  content  of  each  new  conjecture? 

•  If  you’re  interested  in  natural  language,  the  very  hard  problem  exists  of  giving  AM 
(or  a  similar  system)  the  ability  to  really  do  inferential  processing  on  the  reasons 
attached  to  tasks  on  the  agenda.  Instead  of  Just  being  able  to  test  for  equality  of 
two  reasons,  it  would  be  much  more  intelligent  to  be  able  to  infer  the  kind  of 
relationship  between  any  two  reasons;  if  they  overlap  semantically,  we’d  like  to  be 
able  to  compute  precisely  how  that  should  that  effect  the  overall  rating  for  the  task; 
etc. 


•  Modify  the  control  structure  of  AM,  as  follows.  Allow  mini-goals  to  exist,  and 
supply  new  rules  for  setting  them  up  (plausible  goal  generators)  and  altering  those 
goals,  plus  some  new  rules  and  algorithms  for  satisfying  them.  The  modification  I 
have  in  mind  would  result  in  new  tasks  being  proposed  because  of  certain  current 
goals,  and  existing  tasks  would  be  reordered  so  as  to  raise  the  chance  of  satisfying 
some  important  goal.  Finally,  the  human  watching  AM  would  be  able  to  observe 
the  rationality  (hopefully)  of  the  goals  which  were  set.  The  simple  "Focus  of 
Attention"  mechanism  already  in  AM  is  a  tiny  step  in  this  goal-oriented  direction. 
Note  that  this  proposal  itself  demonstrates  that  AM  is  not  inherently  opposed  to  a 
goal-directed  control  structure.  Rather,  AM  simply  possesses  only  a  partial  set  of 
mechanisms  for  complete  reasoning  about  its  domain. 


7.1.10.  Comparison  to  Other  Systems 

One  popular  way  to  judge  a  system  is  to  compare  ii  to  other,  similar  systems,  and/or  to 
others’  proposed  criteria  for  such  systems.  There  is  no  other  project  (known  to  the  author) 
having  the  same  objective:  automated  math  research.^®  Many  somewhat  related  efforts  have 
been  reported  in  the  literature  and  will  be  mentioned  here. 

Several  projects  have  been  undertaken  which  overlap  small  pieces  of  the  AM  system  and  in 
addition  concentrate  deeply  upon  some  area  not  present  in  AM.  For  example,  the  CLET 
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system  [Badre  73]  worked  on  learning  the  decimal  addition  algorithm^®  but  the 
"mathematics  discovery"  aspects  of  that  system  were  neither  emphasized  nor  worth 
emphasizing:  it  was  an  interesting  natural  language  communication  study.  The  same 
comment  applies  to  several  related  studies  by  IMSSS®'. 

Boyer  and  Moore’s  theorem-prover  [Boyer&Moore  75]  embodies  some  of  the  spirit  of  AM 
(e.g.,  generalizing  the  definition  of  a  LISP  function),  but  Its  motivations  are  quite  different, 
its  knowledge  base  is  minimal,  and  its  methods  purely  formal.®^  The  same  comments  apply 
to  the  SAM  program  [Guard  69],  in  which  a  resolution  theorem-prover  is  set  to  work  on 
unsolved  problems  in  lattice  theory. 

Among  the  attempts  to  incorporate  heuristic  knowledge  into  a  theorem  prover,  we  should 
also  mention  [Wang  60],  [Pitrat  70],  [Bledsoe  71],  and  [Brotz  74].  How  did  AM  differ  from 
these  "heuristic  theorem-provers"?  The  goal-driven  control  structure  of  these  systems  is  a 
real  but  only  minor  difference  from  AM’s  control  structure  (e.g.,  AM’s  "focus  of  attention"  is 
a  rudimentary  step  in  that  direction;  see  p.  150).  The  fact  that  their  overall  activity  is 
typically  labelled  as  deductive  is  also  not  a  fundamental  distinction  (since  constructing  a 
proof  is  usually  in  practice  quite  inductive).  Even  the  character  of  the  inference  processes 
are  analogous:  The  provers  typically  contain  a  couple  binary  inference  rules,  like  Modus 
Ponens,  which  are  relatively  risky  to  apply  but  can  yield  big  results;  AM’s  few  "binary" 
operators  have  the  same  characteristics:  Compose,  Canonize,  Logically-combine  (disjoin  and 
conjoin).  The  main  distinction  is  that  the  theorem  provers  each  incorporate  only  a  handful 
of  heuristics.  The  reason  for  this,  in  turn,  is  the  paucity  of  good  heuristics  which  exist  for 
the  very  general  task  environment  in  which  they  operate:  domain-independent  (asemantic) 
predicate  calculus  theorem  proving.  The  need  for  additional  guidance  was  recognized  by 
these  researchers.  For  example,  see  [Wang  60],  p.  3  and  p.  17.  Or  as  Bledsoe  says®®: 

There  is  a  real  difference  between  doing  some  mathematics  and  being  a 
mathematician.  The  difference  is  principally  one  of  judgment:  in  the  selection  of  a 
problem  (theorem  to  be  proved);  in  determining  its  relevance;...  1.’  is  precisely  in 
these  areas  that  machine  provers  have  been  so  lacking.  This  kind  of  judgment  has 
to  be  supplied  by  the  user...  Thus  a  crucial  part  of  the  resolution  proof  is  the 
selection  of  the  reference  theorems  by  the  human  user;  the  human,  by  this  one 
action,  usually  employs  more  skill  than  that  used  by  the  computer  in  the  proof. 

Many  researchers  have  constructed  programs  which  pioneered  some  of  the\echniques  AM 
uses®'’.  [Gelernter  63]  reports  the  use  of  prototypical  examples  as  analogic  models  to  guide 
search  in  geometry,  and  [Bundy  73]  employs  models  of  "sticks"  to  help  his  program  work 
with  natural  numbers.  The  sing'e  heuristic  of  analogy  was  studied  in  [Evans  68]  and 

30 

Giv«n  th«  tddilion  l«bl«  up  to  10  «  10,  pkit  *n  Englith  t«xt  description  of  whst  it  mesnt  to  carry,  how  end  when  to 
carry,  etc ,  actually  write  a  program  capable  of  adding  two  3-digit  numbers 

3 1 

See  [Smith  74a],  for  example. 

32 

This  IS  not  meant  as  criticism:  considering  the  gocls  of  those  researchers,  and  the  age  of  that  system,  their  work  is  quite 
significant. 

®®  [Bledsoe  71J,  p.  73 

®'’  In  many  cases,  those  techniques  were  used  for  the  first  time,  hence  were  thought  of  as  'tricks’. 
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[Kling  7I].35 

Theory  formation  systems  in  any  field  have  been  few.  Meta-Dendral  [Buchanan  74] 
represents  perhaps  the  best  of  these.  Its  task  is  to  unify  a  body  of  mass  spectral  data 
(examples  of  "proper"  identifications  of  spectra)  into  a  small  body  of  rules  for  making 
identifications.  Thus  even  this  system  is  given  a  fixed  task,  a  fixed  set  of  data  to  find 
regularities  within.  AM,  however,  must  find  its  own  data,  and  take  the  responsibility  for 
managing  its  own  time,  for  not  looking  too  long  at  worthless  data.^®  There  has  been  much 
written  about  scientific  theory  formation  (e.g.,  [Hempel  52]),  but  very  little  of  it  is  specific 
enough  to  be  of  immediate  use  to  AI  researchers.  A  couple  pointers  to  excellent  discussions 
of  this  sort  are:  [Fogel  66],  [Simon  73],  and  [Buchanan  76].  Also  worth  noting  is  a 
discussion  near  the  end  of  [Amarel  69],  in  which  "formation"  and  "modelling"  problems  are 
treated: 

The  problem  of  model  finding  is  related  to  the  following  general  question  raised 
by  Schutzenberger  [in  discussion  at  the  Conference  on  Intelligence  and  Intelligent 
Systems,  Athens,  Ga.,  1967]:  'W/iat  do  we  want  to  do  with  intelligent  systems  that 
relates  to  the  work  of  mathematicians?'.  So  far  all  we  have  done  in  this  general 
area  is  to  emulate  some  of  the  reasonably  simple  activities  of  mathematicians, 
which  is  finding  consequences  from  given  assumptions,  reasoning,  proving 
theorems.  A  certain  amount  of  work  of  this  type  was  already  done  in  the 
propositional  and  predicate  calculi,  as  well  as  in  some  other  mathematical  systems. 

But  this  is  only  one  aspect  of  the  work  that  goes  on  in  mathematics. 

Another  very  important  aspect  is  the  one  of  finding  general  properties  of 
structures,  finding  analogies,  similarities,  isomorphisms,  and  so  on.  This  is  the 
type  of  activity  that  is  extremely  important  for  our  understanding  of  model¬ 
finding  mechanisms.  Work  in  this  area  is  more  difficult  than  theorem-proving.  The 
problem  here  is  that  of  theorem  finding. 

AM  is  one  of  the  first  attempts  to  construct  a  "theorem-finding"  program.  As  Amarel  noted, 
it  may  be  possible  to  learn  from  such  programs  how  to  tackle  the  general  task  of  automating 
scientific  research. 

Besides  "math  systems",  and  "creative  thinking  systems",  and  "theory  formation  systems",  we 
should  at  least  discuss  others’  thoughts  on  the  issue  of  algorithmically  doing  math  research. 
Some  individuals  feel  it  is  not  so  far-fetched  to  imagine  automating  mathematical  research 
le.g.,  Paul  Cohen).  Others  (e.g.,  Polya)  would  probably  disagree.  The  presence  of  a  high¬ 
speed,  general-purpose  symbol  manipulator  in  our  midst  now  makes  investigation  of  that 
question  possible. 

There  has  been  very  little  published  thought  about  discovery  in  mathematics  from  an 
algorithmic  point  of  view;  even  clear  thinkers  like  Polya  and  Poincare'  treat  mathematical 
ability  as  a  sacred,  almost  mystic  quality,  tied  to  the  unconscious.  The  v'ritings  of 
philosophers  and  psychologists  invariably  attempt  to  examine  human  performance  and 
belief,  which  are  far  more  managablc  than  creativity  in  vitro.  Belief  formulae  in  inductive 


35 

BtoU'*  protram,  [Srotz  74],  uttt  thia  to  propof*  uaatui  Mmmala 

36 

In  COM  that  watn’t  claar:  Mata-Dandral  hat  a  fixad  tat  of  tamplataa  for  rula-.  which  it  withaa  to  find,  and  a  fixed 
vocabulary  of  matt  apectral  concapta  which  can  ba  plugged  intr  ihote  tamplataa.  AM  alto  haa  only  a  few 
atock  formate  for  conjacturaa,  but  it  aalactivaly  anlargat  ita  vr-.abulary  of  math  concapta 


Ch*pter  7 


AMt  Discovtry  in  Mtthamatic*  m  Heurittic  Starch 


-153- 


logic®^  invariably  fall  back  upon  how  well  they  fit  human  measurements.  The  abilities  of  a 
computer  and  a  brain  are  too  distinct  to  consider  blindly  working  for  results  (let  alone 
algorithms!)  one  possesses  which  match  those  of  the  other. 


7.2.  Capabilities  and  Limitations  of  AM 

The  first  two  subsections  contain  a  general  discussion  of  what  AM  can  and  can’t  do.  Later 
subsections  deal  with  powers  and  limitations  inherent  in  using  an  agenda  scheme,  in  fixing 
the  domain  of  AM,  and  in  picking  one  specific  model  of  math  research  to  build  AM  upon. 
The  AM  program  exists  only  because  a  great  many  simplifying  assumptions  were  tolerated; 
these  are  discussed  in  Section  7.2.4  (p.  157).  Finally,  some  speculation  is  made  about  the 
ultimate  powers  and  weaknesses  of  any  systems  which  are  designed  very  much  like  AM. 


7.2.1.  Current  Abilities 

What  fields  has  AM  worked  in  so  far?  AM  is  now  able  to  explore  a  small  bit  of  the  theory 
of  sets,  data  types,  numbers,  and  plane  geometry.  It  by  no  means  has  been  fed  -  nor  has  it 
rediscovered  -  a  large  fraction  of  what  is  known  in  any  of  those  fields.  It  might  be  more 
accurate  to  be  humble  and  restate  those  domains  as:  elementary  finite  set  theory,  trivial 
observations  about  four  kinds  of  data  types,  arithmetic  and  elementary  divisibility  theory, 
and  simple  relationships  between  lines,  angles,  and  triangles.  So  a  sophisticated  concept  in 
each  domain  -  which  was  discovered  by  AM  -  might  be: 

•  de  Morgan’s  laws 

•  the  fact  that  Deleteolnsert®*  never  alters  Bags  or  Lists 

•  unique  factorization 

•  similar  triangles 

Can  AM  work  in  a  new  field,  like  politics?  AM  can  work  in  a  new  elementary,  formalized 
domain,  if  it  is  fed  a  supplemental  base  of  conceptual  primitives  for  that  domain.  To  work 
in  plane  geometry,  it  sufficed  to  give  AM  about  twenty  new  primitive  concepts,  each  with  a 
few  parts  filled  In.  Another  domain  which  AM  could  work  in  would  be  elementary 
mechanics.  The  more  informal  the  desired  field,  the  less  of  AM  that  is  relevant.  Perhaps  an 
AM-like  system  could  be  built  for  a  constrained,  precise  political  task.®®  Disclaimer:  Even 
for  a  very  small  domain,  the  amount  of  common-sense  knowledge  such  a  system  would  need 
is  staggering.  It  is  unfortunate  to  provide  such  a  trivial  answer  to  such  an  important 
question,  but  there  is  no  easy  way  to  answer  it  more  fully  until  years  of  additional  research 
are  performed. 

Can  AM  discover  X?  Why  didn't  it  do  Y?  It  is  difficult  to  predict  whether  AM  will  (without 


37 


38 
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For  axampl*,  ««t  [Hintikka  62],  [Pittirinin  72]  Th«  latttr  alto  containa  a  (ood  autnmary  of  Carnap’t  X,oC  formalization. 
Takt  an  Htm  x,  inttrt  H  into  (Iht  front  of)  atruclur*  B,  th»n  daitto  ont  <the  firat)  occurrtnet  of  x  from  B 

For  txamplt,  auch  a  politica-oritnttd  AM-lik»  ayattm  might  conctivt  th«  notion  of  a  group  of  political  intitiat  which  view 
themaelvtt  at  quite  ditparate,  but  which  are  viewed  from  the  outtide  at  a  tingle  unit,  eg,  'the  Arabt',  'the 
American  Indiana'  Conjecturet  about  thit  concept  might  include  itt  reputation  at  a  poor  combatant  (and 
why).  Many  of  the  tame  fKete  AM  uttt  would  carry  over  to  repretent  concaptt  in  that  new  domain 
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modifications)  ever  make  a  specific  given  discovery.  Although  its  capabilities  are  small,  its 
limitations  are  hazy.  What  makes  the  matter  even  worse  is  that,  given  a  concept  C  which 
AM  missed  discovering,  there  is  probably  a  reasonable  heuristic  rule  which  is  missing  from 
AM,  which  would  enable  that  discovery.  One  danger  of  this  "debugging"  is  that  a  rule  will 
be  added  which  only  leads  to  that  one  desired  discovery,  and  isn’t  good  for  anything  else.  In 
that  case,  the  new  heuristic  rule  would  simply  be  an  encoding  of  a  specific  bit  of 
mathematics  which  AM  would  then  appear  to  discover  using  general  methods.  This  must 
be  avoided  at  all  costs,  even  at  the  cost  of  intentionally  giving  up  a  certain  discovery.  If  the 
needed  rule  is  general  -  it  has  many  applications  and  leads  to  many  interesting  results  — 
then  It  really  was  an  oversight  not  to  include  it  in  AM.  Although  I  believe  that  there  are 
not  too  many  such  omissions  still  within  the  small  realm  AM  explores,  there  is  no  objective 
way  to  demonstrate  that,  except  by  further  long  tests  with  AM. 

In  what  ways  are  new  concepts  created^  Although  the  answer  to  this  is  accurately  given  in 
Section  4.3,  page  42  (namely,  this  is  mainly  the  Jurisdiction  of  the  right  sides  of  heuristic 
rules),  ?nd  although  1  dislike  the  simple-minded  way  it  makes  AM  sound,  the  list  below 
does  characterize  the  major  ways  in  which  new  concepts  get  born: 

Fill  in  flxamples  of  a  eoncopt  (e.g.,  by  instaniiating  or  running  its  dofinition) 

Croat*  a  gonoralization  of  a  given  concept  (e.g.,  by  weakening  its  definition) 

Create  a  specialization  of  a  given  concept  (e.g.,  by  restricting  its  domain/rang*) 

Compose  two  operations  f,g,  thereby  creating  a  new  one  h.  [Define  h(x)sf(g(x)}] 

Coalesce  an  operation  f  into  a  new  one  g.  [Define  g(x)sf(x,x)] 

Permute  the  order  of  the  arguments  of  an  operation.  [Define  g(x,y)3f(y,x}] 

Invert  an  operation  [g(x)=y  iff  f(y)»x]  (e.g.,  from  Squaring,  create  Square-rooting) 

Canonize  one  predicate  PI  with  respect  to  a  more  general  on*  P2  [create  a  new  concept  f, 
an  operation,  such  that:  P2(x,y)  iff  Pl(f(x),f(y)}j 
Create  a  new  operation  g,  which  is  the  repealed  application  of  an  existing  operation  f. 

The  usual  logical  combinations  of  existing  concepts  x,y:  xAy,  xvy,  ■'X,  etc. 

Below  is  a  similar  list,  giving  the  primary  ways  in  which  AM  formulates  new  conjectures: 
Notice  that  concept  Cl  is  really  an  example  of  concept  C2 
Notice  that  concept  Cl  is  really  a  specialization  (or:  generalization)  of  C2 
Notice  that  Cl  is  equal  to  C2:  or:  almost  always  equal 
Notice  that  Cl  and  C2  are  related  by  some  Known  concept 
Check  and  update  the  domain/range  of  an  existing  operation 
If  two  concepts  are  analogous,  extend  the  analogy  to  their  conjectures  as  well 

In  summary,  we  can  say  that  AM  has  achieved  its  original  purpose;  to  be  guided 
successfully  by  a  large  set  of  local  heuristic  rules,  in  the  discovery  of  new  mathematical 
theories.  Besides  creating  new  concepts  and  noticing  conjectures,  AM  has  the  key  "ability" 
of  appearing  to  decide  rationally  what  to  work  on  at  each  moment.  This  is  a  result  of  the 
agenda  of  tasks  -  containing  associated  reasons.  Of  course  all  of  these  abilities  stem  from 
the  quality  and  the  quantity  of  local  heuristic  rules:  little  plausible  move  generators  and 
evaluators. 


1.  Current  Limitations 


Below  are  several  shortcomings  of  AM,  which  hurt  its  behavior  but  are  not  believed  to  be 
inherent  limitations  of  its  design.  They  are  presented  in  order  of  decreasing  severity. 
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Perhaps  the  most  serious  limitation  on  AM’s  current  behavior  arose  from  the  lack  of 
constraints  on  left  sides  of  heuristic  rules.  It  turned  out  that  this  excessive  freedom  made  it 
difficult  for  AM  to  inspect  and  analyze  and  synthesize  its  own  heuristics;  such  a  need  was 
not  foreseen  at  the  time  AM  was  designed.  It  was  thought  that  the  power  to  manipulate 
heuristic  rules  was  an  ability  which  the  author  must  have,  but  which  the  system  wouldn’t 
require.  As  it  turned  out,  AM  did  successfully  develop  new  concepts  several  levels  deeper 
than  the  ones  it  started  with.  But  as  the  new  concepts  got  further  and  further  away  from 
those  initial  ones,  they  had  fewer  and  fewer  specific  heuristics  filled  in  (since  they  had  to  be 
filled  in  by  AM  itself).  Gradually,  AM  found  itself  relying  on  heuristics  which  were  very 
general  compared  to  the  concepts  it  was  dealing  with  (e.g.,  forced  to  use  heuristics  about 
Objects  when  dealing  with  Numbers).  Heuristics  for  dealing  with  heuristics  do  exist,  and 
their  number  could  be  increased.  This  is  not  an  easy  job;  finding  a  new  meta-heuristic  is  a 
tough  process.  Heuristics  are  rarely  more  than  compiled  hindsight;  hence  it’s  difficult  to 
create  new  ones  "before  the  fact". 

AM  has  no  notion  of  proof,  proof  techniques,  formal  validity,  heuristics  for  finding 
counterexamples,  etc.  Thus  it  never  really  establishes  any  conjecture  formally.  This  could 
probably  be  remedied  by  adding  about  25  new  concepts  (and  their  100  new  associated 
heuristics)  dealing  with  such  topics.  The  needed  concepts  have  been  outlined  on  paper,  but 
nut  yet  coded.  It  would  probably  require  a  few  hundred  hours  to  code  and  debug  them. 

The  user  interface  is  quite  primitive,  and  this  again  could  be  dramatically  improved  with 
just  a  couple  hundred  hours’  work.  AM’s  explanation  system  is  almost  nonexistent;  the  user 
must  ask  a  question  quickly,  or  AM  will  have  already  destroyed  the  information  needed  to 
construct  an  answer.  A  clean  record  of  recent  system  history  and  a  nice  scheme  for  tracking 
down  reasons  for  modifying  old  rules  and  adding  new  ones  dynamically  does  not  exist  at 
the  level  which  is  found,  e.g.,  in  MYCIN  (Davis  76].  There  is  no  trivial  way  to  have  the 
system  print  out  its  heuristics  in  a  format  which  is  intelligible  to  the  untrained  user. 

An  important  type  of  analogy  which  was  untapped  by  AM  was  that  between  heuristics.  If 
two  situations  were  similar,  conceivably  the  heuristics  useful  in  one  situation  might  be 
useful  (or  have  useful  analogues)  in  the  new  situation  (see  [Koppelman  75]).  Perhaps  this 
is  a  viable  way  of  enlarging  the  known  heuristics.  Such  "meta-level"  activities  were  kept  to 
a  minimum  throughout  AM,  and  this  proved  to  be  a  serious  limitation.  My  intuition  tells 
me  that  the  "right"  ten  meta-rules  could  correct  this  particular  deficiency. 

The  idea  of  "Intuitions"  facets  was  a  flop.  Intuitions  were  meant  to  model  reality,  at  least 
little  pieces  of  it,  so  that  AM  could  perform  (simulate)  physical  experiments,  and  observe  the 
results.  The  major  problem  here  was  that  so  little  of  the  world  was  modelled  that  the  only 
relationships  derivable  were  those  foreseen  by  the  author.  This  lack  of  generality  was 
unacceptable,  and  the  intuitions  were  completely  excised.  The  original  idea  might  lead 
somewhere  if  it  were  developed  fully.  As  with  all  limitations  of  AM,  I  leave  this  as  an  open 
suggestion  for  future  research. 

Several  limitations  arose  from  the  constraints  of  the  agenda  scheme,  from  the  choice  of  finite 
set  theory  as  the  domain  to  work  in,  and  from  the  particular  model  of  math  research  that 
was  postulated.  These  will  be  discussed  in  the  next  few  subsections. 
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7.2.3.  Limitations  of  the  Agenda  scheme 

The  following  quibbles  with  the  agenda  scheme  get  less  and  less  important.  When  you  get 
bored,  skip  to  the  next  subsection. 

Cuitently,  it  is  difficult  to  include  heuristics  which  interact  with  one  another  in  any 
significant  way.  The  whole  fibre  of  the  Agenda  scheme  assumes  perfect  independence  of 
heuiistics.  The  global  formula  used  to  rate  tasks  on  the  agenda  assumes  perfect 
supei position  of  reasons:  there  are  no  "cross-terms".  Is  this  assumption  always  valid? 
Unfortunately  no,  not  even  for  the  limited  domain  AM  has  explored.  Sometimes,  two 
reasons  are  very  similar:  "Examples  of  Sets  would  permit  finding  examples  of  Union"  and 
"Examples  of  Sets  would  permit  finding  examples  of  Intersection".  In  that  case,  their  two 
ratings  shouldn’t  cause  such  a  big  increase  in  the  overall  priority  value  of  the  task  "Fillin 
axamplss  of  Sets". 

‘'ometimes,  a  heuristic  rule  will  want  to  dissuade  the  system  from  some  activity.  Thus  a 
ririjr/rrr  numeric  coiiiiibution  to  a  task’s  priority  value  is  desired.  This  is  not  figured  into 
the  ciiiif’nt  scheme.  With  a  slight  modification,  the  global  formula  could  preserve  the  sign 
(signum)  of  each  reason’s  rating. 

I  a^ks  on  the  agenda  list  are  ordered  by  their  numeric  priority  value.  Each  reason’s 
iiumeiu.  value  is  kept,  too.  When  new  reasons  are  added,  these  values  are  used  to 
terompute  a  new  priority  for  the  task.  Each  reason's  rating  was  computed  by  a  little 
foirnula  found  inside  some  heuristic  rule.  Those  formulae  are  not  kept  hanging  around. 
Cm  big  improvement  in  apparent  intelligence  could  be  attained  by  tacking  on  those  little 
formulae  to  the  reasons.  When  a  new  reason  is  added,  the  old  reasons’  rating  formulae 
would  be  evaluated  again.  They  might  indeed  give  new  numbers.  For  example,  suppose 
one  reason  was  "Few  examples  of  X  are  known".  But  by  now,  other  tasks  have  meanwhile 
inadvertantly  filled  in  several  examples  of  X.  Then  that  little  reason’s  formula  would  come 
up  with  a  much  lower  value  than  it  did  originally.  In  fact,  the  value  might  be  so  low  that 
the  reason  was  dropped  altogether.  If  the  formulae  were  kept,  it  might  be  good  practice  to 
evaluate  them  for  the  top  two  or  three  tasks  on  the  agenda,  to  see  if  they  might  change  their 
ordering  Also,  the  top  task’s  priority  would  then  be  more  accurate,  and  recall  that  its  value 
is  used  to  determine  the  cpu  time  and  list  cell  space  quanta  that  the  task  is  allowed  to  use 
up  At  the  moment,  AM  is  not  set  up  to  store  the  little  functions,  and  if  modified  to  do  so,  it 
uses  up  a  lot  more  space  than  it  can  afford.  Also,  the  top  few  jobs  are  almost  never 
semantically  coupled  (except  by  "focus  of  attention"),  so  the  precise  order  in  which  they  are 
executed  rarely  matters. 

Perhaps  what  is  needed  is  not  a  single  priority  value  for  each  task,  but  a  vector  of  numbers. 
At  each  cycle,  AM  would  construct  a  vector  of  its  current  "interests"  and  needs,  and  each 
task’s  vector  would  be  dot-multiplied  against  this  global  vector  of  AM’s  desires.  The 
highest  scorer  would  then  be  chosen.  For  example,  one  dimension  of  the  rating  could  be 
"safety",  and  one  could  be  "best  possible  payoff",  one  could  be  "average  expected  payoff",  etc. 
Sometimes,  AM  would  have  to  break  out  of  a  stagnant  situation,  and  it  would  be  willing  to 
try  riskier  tasks  than  usual.  This  was  not  implemented  because  of  the  great  increase  in  cpu 
time  It  would  cause.  It  is,  however,  probably  a  better  design  than  the  current  one.  Even 
more  intelligent  schemes  can  be  envisioned  -  involving  more  and  more  symbolic  data  being 
stored  with  each  task.  Ultimately,  this  would  be  Just  the  English  reasons  themselves;  by  that 
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time,  the  task-orderer  would  have  grown  into  an  incredibly  complex  AI  program  itself  (a 
natural  language  program  plus  an  interrelator  plus...). 

The  agenda  list  should  really  be  an  agenda  tree^°,  since  the  ordering  of  tasks  is  really  just 
partial,  not  total.  If  this  is  clear,  then  skip  the  rest  of  this  paragraph.  There  are  some 
’’legitimate"  orderings  of  tasks  on  the  agenda;  if  task  X  is  supported  by  a  subset  of  the 
reasons  which  support  Y,  then  typically  the  priority  of  X  will  be  less  than  or  equal  to  the 
priority  of  Y.  Two  tasks  of  the  form  "Fillin  examples  of  A",  "Fill  in  examples  of  B"  can  be 
ordered  simply  because  A  is  currently  much  more  interesting  than  B.  But  often,  two  tasks 
will  have  no  ironclad  ordering  between  them:  compare  "Fillin  examples  of  Sets"  and  "Check 
generalizations  of  Union".  Thus  the  ordering  is  only  partial,  and  it  is  the  artifice  of  the 
global  evaluation  function  which  embeds  this  into  a  linear  ordering.  If  multiprocessors  are 
used,  it  might  be  advantageous  to  keep  the  original  partial  ordering  around. 


7.2.4.  Limiting  Assumptions 

AM  only  "got  off  the  ground"  because  a  number  of  sweeping  assumptions  were  made, 
pertaining  to  what  could  be  ignored,  how  a  complex  process  could  be  adequately  simulated, 
etc.  Now  that  AM  is  running,  however,  those  same  simplifications  crop  up  as  limitations  to 
the  system’s  behavior.  Each  of  the  following  points  is  a  'convenient  falsehood’.  Although 
the  reader  has  already  been  told  about  some  of  these,  it’s  worth  listing  them  ail  together 
here: 

•  The  only  communication  necessary  from  AM  to  the  user  is  keeping  the  user 
informed  of  what  AM  is  doing.  No  natural  language  ability  is  required  by  AM; 
simple  template  instantiation  is  sufficient. 

•  The  only  communication  from  the  user  to  AM  is  an  occasional  interrupt,  when  the 
user  wishes  to  provide  some  guidance  or  to  pose  a  query.  Both  of  these  can  be 
stereotyped  and  passed  easily  through  a  very  narrow  channel.^' 

•  Each  heuristic  has  a  well-defined  domain  of  applicability,  which  can  be  specified 
Just  by  giving  the  name  of  a  single  concept. 

•  If  concept  Cl  is  more  specialized  than  C2,  then  Cl’s  heuristics  will  be  more 
powerful  and  should  be  exccuteo  before  C2’s  (whenever  both  concepts’  heuristics 
are  relevant). 

•  If  h  1  and  h2  are  two  heuristics  attached  to  concept  C,  then  it  is  not  necessary  to 
spend  any  time  ordering  them. 

•  Heuristics  superimpose  perfectly;  they  never  interact  strongly  with  each  other. 
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•  The  reasons  supporting  a  task  can  be  mere  tokens;  it  suffices  to  be  able  to  inspect 
them  for  equality.  They  need  not  follow  a  constrained  syntax.  The  value  of  a 
reason  is  adequately  characterized  by  a  unidimensional  numeric  rating. 

•  The  reasons  supporting  a  task  superimpose  perfectly;  they  n  cV6r  interact  with  each 
other. 

•  Supporting  reasons  -  and  their  ratings  -  never  change  with  time,  with  one 
exception:  the  ephemeron  'Focus  of  attention’. 

•  It  doesn’t  matter  in  what  order  the  supporting  reasons  for  a  task  were  added. 

•  There  is  no  need  for  negative  or  inhibitory  reasons,  which  would  decrease  the 
priority  value  of  a  task. 

•  At  any  moment,  the  top  few  tasks  on  the  agenda  are  not  coupled  strongly;  it  is  not 
necessary  to  expend  extra  processing  time'to  carefully  order  them. 

•  The  tusks  on  the  agenda  are  completely  independent  of  each  other,  in  the  sense  of 
one  task  ‘enabling’  or  ‘waking-up’  another. 

•  Mathematics  research  has  a  clean,  simple  model  (see  Section  7.2.6,  page  162), 
which  indicates  that  it  is  a  search  process  governed  by  a  large  collection  of  heuristic 
rules. 

•  Elementary  mathematics  is  such  that  valuable  new  concepts  will  be  discovered  fairly 
regularly. 

•  The  worth  of  each  new  concept  can  be  estimated  easily,  after  just  a  brief 
investigation. 

•  Contradictions  will  arise  very  rarely,  and  it  is  not  disastrous  to  ignore  them  whe.n 
they  do  occur.  The  same  indifference  applies  to  the  danger  of  believing  in  fa.l'e 
conjectures. 

•  When  doing  theory  formation  in  elementary  mathematics,  proof  and  formal 
reasoning  are  dispensable. 

•  Even  as  more  knowledge  is  obtained,  the  set  of  facets  need  never  change. 

•  For  any  piece  of  knowledge  sought  or  obtained,  there  is  precisely  one  facet  of  one 

existing^^  concept  where  that  knowledge  ought  to  be  stored,  and  it  is  easy  to 
determine  that  proper  location. 

•  Even  as  more  concepts  are  defined,  the  body  of  heuristics  need  not  grow  much. 
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>  Any  common-sense  knowledge  required  by  AM  is  automatically  present  within  the 
heuristic  rules.  So,  e.g.,  no  special  spatial  visualization  abilities  are  needed. 

It  is  worth  repeating  here  that  the  above  assumptions  are  all  clearly /a/r<.  Yet  none  of  them 
was  too  damaging  to  AM’s  behavior,  and  their  combined  presence  made  the  creation  of 
AM  feasible. 


7.2.5.  Choice  of  Domain 


T he  genesis  of  mathematical  creation  is  a  problem  which  should  intensely  interest 
the  psychologist.  It  is  the  activity  in  which  the  human  mind  seems  to  take  least 
from  the  outside  world,  in  which  it  acts  or  seems  to  act  only  of  itself  and  on  itself, 
so  that  in  studying  the  procedure  of  mathematical  thought  we  may  hope  to  reach 
what  is  most  essential  in  man's  mind. 


Poincart’ 


Here  are  some  questions  this  subsection  will  address; 

•  What  are  the  inherent  limitations  -  and  advantages  -  in  fixing  a  domain  for  AM 
to  work  in? 

•  What  characteristics  arc  favorable  to  automating  research  in  any  given  domain? 

•  What  are  the  specific  reasons  for  and  against  elementary  finite  set  theory  as  the 
chosen  starting  domain? 

Research  in  various  domains  of  science  and  math  proceeds  slightly  differently.  For  example, 
psychology  is  interested  in  explaining  people,  not  in  creating  new  kinds  of  people.  Math  is 
not  interested  in  individual  entities  so  much  as  in  new  kinds  of  entities.  There  are  ethical 
restrictions  on  physicians  which  prevent  certain  experiments  from  being  done.  Political 
experiments  rarely  permit  backtracking,  etc.  Each  field  has  its  own  peculiarities. 

If  we  want  a  system  to  work  in  many  domains,  we  have  to  sacrifice  some  power Within  a 
given  field  of  knowledge  (like  math),  the  finer  the  category  we  limit  ourselves  to,  the  more 
specific  are  the  heuristics  which  become  available.  So  it  was  reasonable  to  make  this  first 
attempt  limited  to  one  narrow  domain. 

This  brings  up  the  choice  of  domain.  What  should  it  be?  As  the  DENDRAL  project 
illustrated  so  clearly^^,  choice  of  subject  domain  is  quite  important  when  studying  how 
researchers  discover  and  develop  their  theories.  Mathematics  was  chosen  as  the  domain  of 
this  investigation,  because 

1.  In  doing  math  research,  one  needn’t  cope  with  the  uncertainties  and  fallability  of 
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testing  equipment;  that  is,  there  are  no  uncertainties  in  the  data  (compared  to,  e.g., 
molecular  structure  inference  from  mass  spectrograms). 

2.  Reliance  on  experts’  introspections  is  one  of  the  most  powerful  techniques  for 
codifying  the  judgmental  criteria  necessary  to  do  effective  work  in  a  field;  I 
personally  have  had  enough  training  in  elementary  mathematics  so  that  I  didn’t 
have  to  rely  completely  on  external  sources  for  guidance  in  formulating  such 
heuristic  rules.  Also,  several  excellent  sources  were  available  [Polya,  Skemp, 
Hadamard,  Kershner,  etc.]. 

3.  The  more  formal  a  science  is,  the  easier  it  is  to  automate.  For  a  machine  to  carry  out 

research  in  psychology  would  require  more  knowledge  about  human  information 
processing  than  now  is  known,  because  psychology  deals  with  entities  as  complex  as 
you  and  I.  Also,  in  a  formal  science,  the  {..nguages  to  communicate  information  can 
be  simple  even  though  the  messages  themselves  be  sophisticated. 

4.  Since  mathematics  can  deal  with  any  conceivable  constructs,  a  researcher  there  is  not 

limited  to  explaining  observed  data.  Related  to  this  is  the  freedom  to  investigate  - 
or  to  give  up  on  -  whatever  the  researcher  wants  to.  There  is  no  single  discovery 
which  IS  the  "goal",  no  given  problem  to  solve,  no  right  or  wrong  behavior. 

5.  Unlike  "simpler"  fields,  such  as  propositional  logic,  there  is  an  abundance  of  heuristic 

rules  available  for  the  picking. 

The  limitations  of  math  as  a  domain  are  closely  intertwined  with  its  advantages.  Having 
no  ties  to  real-world  data  can  be  viewed  as  a  limitation,  as  can  having  no  clear  goal.  There 
is  always  the  danger  that  AM  will  give  up  on  each  theory  as  soon  as  the  first  tough  obstacle 
crops  up. 

Since  math  has  been  worked  on  for  millenia  by  some  of  the  greatest  minds  from  many 
different  cultures,  it  is  unlikely  that  a  small  effort  like  AM  would  make  any  new  inroads, 
have  any  startling  insights.  In  that  respect,  Dendral’s  space  was  much  less  explored.  Of 
course  math  -  even  at  the  elementary  level  that  AM  explored  it  -  still  has  undiscovered 
gems  (e.g.,  the  recent  unearthing  of  Conway’s  numbers  [Knuth  74]). 

One  point  of  agreement  between  Weizenbaum  and  Lederberg^®  is  that  A I  can  succeed  in 
automating  an  activity  only  when  a  "strong  theory"  of  that  activity  exists.  AM  is  built  on  a 
detailed  model  of  how  humans  do  math  research.  In  the  next  subsection,  we’ll  discuss  the 
model  of  math  research  that  AM  assumes. 

Before  that,  consider  for  a  moment  how  few  other  fields  of  human  endeavor  have  a  good 
model,  and  also  enjoy  all  the  advantages  listed  above;  other  domains  of  math,  classical 
physics,...  not  many  others. 


7.2.6.  Limitations  of  the  Model  of  Math  Research 
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Weizenbaum  does  point  to  pr(^ects  in  mathematics  and  chemistry  where  computers 
have  shoton  their  potential  for  assisting  human  scientists  in  solving  problems.  He 
correctly  points  out  that  these  successes  are  based  on  the  existence  of  "strong 
theories"  about  their  sut^'ect  matter. 


—  Ltdarbcrt 


AM,  like  anything  else  in  this  world,  is  constrained  by  a  mass  of  assumptions.  Most  of  these 
are  "compiled"  or  interwoven  into  the  very  fabric  of  AM,  hence  can’t  be  tested  by 
experiments  on  AM.  Some  of  these  were  just  discussed  a  few  pages  ago,  in  Section  7.2.4. 

Another  body  of  assumptions  exists.  AM  is  built  around  a  particular  model  of  how 
mathematicians  actually  go  about  doing  their  research.  This  model  was  derived  from 
introspection,  but  can  be  supported  by  quotes  from  Polya,  Kershner,  Hadamard,  Saaty, 
Skemp,  and  many  others.  No  attempt  will  be  made  to  justify  any  of  these  premises.  On  the 
next  page  is  a  simplified  summary  of  that  information  processing  model  for  math  theory 
formation: 
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model  OF  MATH  RESEARCH 


1.  The  order  m  which  a  math  textbook  presents  a  theory  is  almost  the  exact  opposite 

of  the  order  in  which  it  was  actually  discovered  and  developed.  In  a  text,  new 
definitions  are  stated  with  little  or  no  motivation,  and  they  turn  out  to  be  just  the 
ones  needed  to  state  the  next  big  theorem,  whose  proof  then  magically  appears. 
In  contrast,  a  mathematician  doing  research  will  examine  some  already-known 
concepts,  perhaps  trying  to  find  some  regularity  in  experimental  data  involving 
them.  The  patterns  he  notices  are  the  conjectures  he  must  investigate  further, 
and  these  relationships  directly  motivate  him  to  make  new  definitions. 

2.  Each  step  the  researcher  takes  while  developing  a  new  theory  involves  choosing 

from  a  large  set  of  "legal"  alternatives  -  that  is,  searching.  The  key  to  keeping 
this  from  becoming  a  blind,  explosive  search  is  the  proper  use  of  evaluation 
criteria.  Each  mathematician  uses  his  own  personal  heuristics  to  choose  the  "best" 
alternative  available  at  each  moment. 

3.  Non-formal  criteria  (aesthetic  interestingness,  inductive  inference  from  empirical 

evidence,  analogy,  and  utility)  are  much  more  important  than  formal  deductive 
methods  in  developing  mathematically  worthwhile  theories,  and  in  avoiding 
barren  diversions. 

4.  Pt ogress  in  any  field  of  mathematics  demands  much  non-formal  heuristic  expertise 

in  many  different  "nearby"  mathematical  fields.  So  a  broad,  universal  core  of 
knowledge  must  be  mastered  before  any  single  theory  can  meaningfully  be 
developed. 

5.  It  IS  sufficient  (and  pragmatically  necessary)  to  have  and  use  a  large  set  of  informal 

heuristic  rules.  These  rules  direct  the  researcher’s  next  activities,  depending  on 
the  current  situation  he  is  in.  These  rules  can  be  assumed  to  superimpose 
ideally:  the  combined  effect  of  several  rules  is  just  the  sum  of  the  individual 
effects. 

6.  The  necessary  heuristic  rules  are  virtually  the  same  in  all  branches  of  mathematics, 

and  at  all  levels  of  sophistication.  Each  specialized  field  will  have  some  of  its 
own  heuristics;  those  are  normally  much  more  powerful  than  the  general-purpose 
heuristics. 

7.  For  true  understanding,  the  researcher  should  grasp^®  each  concept  in  several  ways: 

declaratively,  abstractly,  operationally,  knowing  when  it  is  relevant,  and  as  a 
bunch  of  examples. 

8.  Common  metaphysical  assumptions  about  nature  and  science:  Nature  is  fair, 

uniform,  and  regular.  Coincidences  have  meaning.  Statistical  considerations  are 
valid  when  looking  at  mathematical  data.  Simplicity  and  symmetry  and  synergy 
are  the  rule,  not  the  exception. 
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7.2.7.  Ultimate  powers  and  weaknesses 

Consider  now  an^  system  which  is  consistent  with  the  preceding  model  of  math  research, 
and  whose  orientation  is  to  discover  and  develop  new  (to  the  system)  mathematical  theories. 
This  includes  AM  itself,  but  might  also  include  a  bright  high-school  senior  who  has  been 
taught  a  large  body  of  heuristic  rules. 

What  can  such  systems  ultimately  achieve?  What  are  their  ultimate  limits?  Answers  to 
ultimate  questions  are  hard  to  come  by  experimentally,  so  this  discussion  will  be  quite 
philosophical,  speculative,  and  short.  The  model  of  math  research  hinges  around  the  use  of 
heuristic  rules  for  guidance  at  all  leve.’s  of  behavior.  It  is  questionable  whether  or  not  all 
known  mathematics  could  evolve  smovithly  in  this  way.  As  a  first  order  fixup,  we've 
mentioned  the  r.eed  to  provide  good  meta-heuristics,  to  keep  enlarging  the  set  of  heuristics. 
If  this  is  not  enough  (if  meta-meta-...-ieuristics  are  needed),  then  the  model  is  a  poor  one 
and  has  some  inherent  limitations.^^  If  some  discoveries  can  only  te  made  non-rationally 
(by  random  chance,  by  Gestalt,  etc.)  then  any  such  system  would  be  incapable  of  finding 
those  concepts. 

Turning  aside  from  math,  what  about  systems  whose  design  -  as  a  computer  program  -  is 
similar  to  AM?'’*  Building  such  systems  will  be  ’fun",  and  perhaps  will  result  in  new 
discoveries  in  other  fields.  Eventually,  scientists  (at  least  in  a  few  very  hard  domains)  may 
relegate  more  and  more  of  their  "hack"  research  duties  to  AM-like  systems.  The  ultimate 
limitations  will  be  those  arising  from  Incorrect  (e.g.,  partial)  models  of  the  activities  the 
system  must  perform.  The  systems  themselves  may  help  improve  these  models:  experiments 
that  are  performed  on  the  systems  are  actually  tests  df  the  underlying  model;  the  results 
might  cause  revisions  to  be  made  in  the  model,  then  in  the  system,  and  the  whole  cycle 
would  begin  again. 


7.3,  Final  Conclusions 

Before  quitting,  let’s  summarize  what’s  worth  remembering  about  this  thesis. 

•  It  is  a  demonstration  that  a  few  hundred  general  heuristic  rules  suffice  to  guide  an 
automated  math  researcher  as  it  explores  and  expands  a  large  but  incomplete 
knowledge  base  of  math  concepts.  .AM  serves  as  ?  living  existence  proof  that  creative 
research  can  be  effectively  modelled  as  heuristic  search. 
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If  Ptolemy  had  had  accata  to  a  digital  computer,  all  hit  data  could  havt  bean  madt  to  fit  (to  any  detirtd  accuracy),  just 
by  computing  api-cyclat,  tpi-api-cyclat,_  to  the  ntadad  number  of  api't  Wa  in  Al  mutt  conttantly  ba  on 
guard  againtt  that  error. 
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Having  an  agenda  of  tatkt  with  reatont  and  reaton-ratinge  combining  to  form  a  global  priority  for  each  tatk,  having 
unitt/modulet/framet/Beinge/Actore/concapte  which  have  parte/elote/facete,  ate.  Heurietic  rulet  era 
tackad  onto  relevant  concapte,  and  are  aaacutad  to  produce  new  concepte,  new  taeke,  new  facet  antriet. 
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•  The  thesis  also  ititroduces  a  control  structure  based  upon  an  agenda  of  small  research 

tasks,  each  with  a  list  of  supporting  reasons  attached. 

•  The  main  limitation  of  AM  was  its  inability  to  synthesize  powerful  new  heuristics  for 

the  new  concepts  it  defined. 

•  The  main  successes  were  the  few  novel  ideas  it  came  up  with,  the  ease  with  which  a  new 

task  domain  was  fed  to  the  system,  and  -  most  importantly  -  the  overall  rational 
sequences  of  behavior  AM  exhibited. 

•  The  greatest  long-range  importance  of  AM  may  well  he  in  the  body  of  heuristics 

assembled  (Appendix  3),  either  as  the  seed  for  a  huge  base  of  experts’  heuristics,  or  as 
a  new  orientation  for  mathematics  education. 
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Appendix  1.  Glossary  of  Technical  Terms 


The  "jargon"  of  a  field  facilitates  communication  among  practitioners  of  that  field,  but  it  too 
often  excludes  novices.  I  have  tried  to  soften  the  impact  of  each  "buzz-word"  when  it  was 
first  used,  but  the  reader  may  need  to  frequently  refresh  his  memory  about  the  meanings  of 
certain  terms. 

This  glossary  is  divided  into  two  sections.  The  first  contains  primarily  Mathematics  terms, 
strangely  biassed  because  it  Just  covers  what  is  referenced  in  this  thesis.  The  second 
glossary,  of  Computer  Science  and  Artificial  Intelligence  terms,  suffers  from  the  same  tunnel 
vision.  They  may  suffice  for  reading  this  document,  but  they  are  certainly  not  meant  to  be 
used  for  more  general  purposes. 

Apoendix  1.1.  Glossarv  of  Math  Terms 

Abduction:  In  logic,  a  syllogism  of  the  form  "from  A,  conclude  that  B  is  probably  true".  If 
your  mental  frame  for  an  automobile  contains  a  hundred  necessary  features,  and  you  see 
something  satisfying  only  90  of  them,  you  can  abductively  conclude  it  is  probably  an 
automobile. 

Cardinality:  the  concept  of  "number".  Two  sets  are  of  the  same  cardinality  iff  they  have  the 
same  number  of  elements. 

Composition  of  two  relations  R  and  S:  This  is  a  new  relation  denoted  RoS,  and  defined  as 
RoS(x)  «  R(S(x)).  So  RoS  maps  elements  of  the  domain  of  S  into  elements  of  the  range  of 
R.  Notice  that  if  R  and  S  are  both  functions,  then  so  is  RoS.  The  intuitive  picture  of  this 
process  is  to  operate  on  x  with  the  relation  S,  and  then  apply  R  to  the  results. 

Function:  an  operation  f  which  associates,  to  each  clement  x  of  some  set  D,  an  element  f(x) 
of  some  set  R.  D  and  R  are  the  domain  and  range  of  f.  Notice  that  a  function  may  be 
considered  a  special  kind  of  relation.  For  a  relation  f  (on  DxR)  to  be  called  a  function,  f 
must  satisfy  two  important  constraints:  (i)  it  must  be  always-defined  on  its  domain;  that  is, 
for  all  domain  elements  x<D,  f(x)  must  exist,  (ii)  f  must  be  single-valued;  that  is,  f(x)  must 
be  a  singleton. 

Iff:  if  and  only  if;  Implies  and  is  implied  by;  is  equivalent  to;  <=>. 

Integers:  positive  and  negative  whole  numbers;  i.e . -2,  -1,  0,  1,  2,... 

Map:  used  as  a  verb,  this  word  indicates  the  action  of  applying  a  function  or  a  relation;  e.g., 
we  say  that  squaring  maps  7  into  49.  Used  as  a  noun,  it  is  a  synonym  for  function. 

Mathematical  concept:  this  is  taken  to  mean  all  the  constructipns,  definitions,  conjectures, 
operations,  structures,  etc.  that  a  mathematician  deals  with.  Some  examples:  Set-intersection, 
Sets,  The  unique  factorization  theorem,  every  entry  listed  in  this  glossary. 
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Mathematical  intuition:  this  is  the  mental  imagery  which  can  be  brought  to  bear.  Typically, 
we  transform  the  situation  to  an  abstract,  simplified  one,  manipulate  it  there,  and  re¬ 
translate  the  results  into  the  original  notation.  For  example,  our  intuition  about  "ordering" 
may  involve  the  image  of  marks  on  a  yardstick.  We  can  then  answer  questions  involving 
ordering  rapidly,  using  this  representation.  Three  features  of  the  intuitive  image  should  be 
noted:  (i)  it  is  typically  fast  and  simple,  (ii)  it  is  opaque,  one  cannot  introspect  too  easily  on 
"why  it  works",  and  (in)  it  is  fallible,  occasionally  leading  to  wrong  results. 

Mathematical  research:  The  fundamental  idea  here  is  that  mathematics  is  an  empirical 
science,  just  as  much  as  chemistry  or  physics.  In  doing  research,  the  ultimate  goal  is  the 
creation  of  new,  interesting  theories,  but  the  techniques  used  include  looking  for  patterns  in 
empirica!  data,  inducing  new  conjectures,  modelling  some  aspects  of  the  real  world,  etc. 
Although  the  final  product  looks  like  a  smooth,  formal  development,  magically  flowing  from 
postulates  to  lemmas  to  theorems,  the  actual  research  process  involved  untold  blind  alleys, 
rough  guesses,  and  hard  work.  (Analogy:  The  process  of  painting  is  rarely  itself  artistic.) 

Mathematical  theory:  to  qualify  as  a  theory,  we  must  have  (i)  a  basis  of  undefined  primitive 
terms,  (ii)  definitions  involving  these,  (lii)  axioms  involving  all  the  primitives  and  defined 
terms  (iv)  conjectures  and  theorems  relating  these  terms.  To  be  at  all  worthwhile,  however, 
the  theory  must  also  meet  the  fuzzy  requirements  that  (v)  there  is  some  correspondence 
between  the  primitives  and  some  "real-world"  concepts,  between  the  axioms  and  some  "real" 
relationships,  and  (vi)  some  of  the  theorems  are  unexpected,  hard  to  prove,  elegant, 
interesting,  etc. 

Mersenne  prime:  a  prime  number  which  happens  to  be  of  the  form  2P-1,  where  p  is  prime. 
Natural  numbers:  non-negative  integers;  i.e.,  0,  i,  2,  3,... 

No.:  an  abbreviation  for  "Number". 

Number:  in  the  typical  loose  fashion  of  computer  scientists,  I  intend  this  to  mean  a  non¬ 
negative  integer:  i.e.,  a  natural  number. 

Ordering;  the  concept  of  "before"  and  "after".  This  distinguishes  a  list  from  a  bag 
(multiset!  The  formal  axioms  for  ordering  simply  state  the  obvious  properties  of  the 
intuitive  image  of  a  list. 

Prime  numbers:  natural  numbers  which  have  no  divisors  other  than  1  and  themself;  e.g.,  17, 
but  not  16  (=3x5).  Primes  are  interesting  because  of  the  myriad  times  they  crop  up  in 
diverse  theorems  -  from  the  Chinese  Remainder  Theorem  (solving  systems  of  linear 
congruence  equations),  to  the  Law  of  Quadratic  Reciprocity,  to  Fermat’s  Theorem  (for  all 

integers  n,  for  all  primes  p,  nP  is  congruent  to  n  (mod  p)).  The  "secret"  of  their  value  lies  in 
the  fact  that  all  integers  can  be  factored  uniquely  into  a  set  of  prime  divisors.  This  "Unique 
Factorization  Theorem"  lets  us  reduce  questions  about  integers  to  questions  about  primes. 

Prime  pairs:  two  prime  numbers  whose  difference  is  two;  e.g.,  17  and  19. 

Relation:  an  operation  which  associates,  for  each  element  of  some  set  D,  a  set  of  elements  E 
-  {ej,  e2,...}  of  some  set  R.  D  and  R  are  the  domain  and  range  of  the  relation.  For  example. 
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the  relation  "<"  associates  to  5  the  set  of  numbers  {5,  6,  7,  8,...}  -  i.e.,  all  integers  which  5  is 
less  than  or  equal  to.  The  domain  and  range  of  this  relation  are  the  integers. 

Set-theoretic:  having  to  do  (in  the  context  of  this  thesis)  with  elementary  finite  set  theory, 
and  the  primitive  notions  of  mathematics  (e.g.,  union,  insert,  predicate,  conjecture). 

Unity:  a  fancy  way  of  referring  to  the  natural  number  "I". 

|:  The  relation  "divides-evenly-into".  Thus  we  say  2|6. 

The  operation  of  negation.  "■•X"  is  read  as  "not  X". 

V:  Disjunction.  "AvB"  is  read  as  "A  or  E". 

A:  Conjunction.  "AaB"  is  read  as  "A  and  B". 

®:  Exclusive  or.  "A*B"  is  read  as  "A  or  B,  but  not  both". 

Implication.  "A-»B"  is  read  as  "If  A  then  B". 

Logical  equivalence.  "A«B"  is  read  as  "A  if  and  only  if  B". 

V:  Universal  quantification.  "VX"  is  read  as  "For  all  X". 

3;  Existential  quantification.  "3X"  is  read  as  "For  some  X". 
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Aopendix  1.2.  Glossarv  of  AI  Terms 

ACTORs:  A  modular  form  of  representatron,  useful  for  distributing  of  the  task  of  control 
among  several  components  in  a  computer  program.  Each  ACTOR  is  a  black  box,  with  no 
parts  or  slots,  but  which  does  have  some  assertions  (a  "contract")  which  he  must  honor.  It 
merely  responds  to  a  fixed  set  of  messages,  by  sending  out  certain  messages  of  his  own. 
These  are  delive,-ed  via  a  bureaucracy.  See  [Hewitt  76]. 

Ai:  an  abbreviation  for  Artificial  Intelligence. 

Bag:  A  bag  is  a  kind  of  list  structure,  a  bunch  of  elements  which  are  unordered,  but  one  in 
which  multiple  copies  of  the  same  element  are  permitted.  One  may  visualize  a  paper  bag 
filled  with  cardboard  letters.  Technically,  we  shall  say  that  a  set  is  not  considered  to  be  a 
bag.  A  bag  is  denoted  by  enclosure  within  parentheses.  Just  as  sets  are  within  braces.  So 
the  bag  containing  X  and  four  Y’s  might  be  written  (X  Y  Y  Y  Y),  and  would  be  considered 
indistinguishable  from  the  bag  (Y  Y  Y  X  Y). 

BEINGs;  A  modular  form  of  representation  of  knowledge,  conceived  as  a  collection  of 
cooperating  experts.  Each  expert  is  modelled  by  one  module,  which  consists  of  a  list  of 
Question/ Answering-program  pairs.  The  set  of  questions  is  fixed  for  all  the  Beings  in  the 
system.  When  any  Being  has  a  question,  he  broadcasts  it  to  the  entire  system,  and  some 
Being  who  recognizes  it  will  take  over  control  and  try  to  answer  it  by  running  his 
appropriate  Answering-program.  In  the  process  of  running  this,  some  new  questions  may 
arise.  Notice  that  Beings  distribute  responsibility  for  control  and  for  static  knowledge.  See 
[Lenat  75b]. 

Bug:  a  flaw  in  a  computer  program.  As  Corey  Sacerdoti  put  it,  a  bug  refers  to  something 
which  is  broken  but  not  badly. 

Concept:  within  the  context  of  this  document,  the  word  "concept"  typically  refers  to  a  precise 
frame-like  data  structure,  a  BEING.  Semantically,  each  concept  is  meant  to  correspond  to 
one  abstract  entity  that  we  would  Intuitively  call  a  concept:  an  object,  an  operator,  a 
conjecture,  etc.  See  "facet". 

Cooperating  Knowledge  Sources:  Very  often,  in  tackling  a  problem,  one  receives  some  hints 
and  some  constraints  from  very  different  sources,  phrased  in  very  different  languages,  often 
addressing  different  representations  of  the  problem.  For  example,  in  trying  understand  a 
human  speaker,  our  memory  of  the  previous  discussion  and  knowledge  of  the  speaker  may 
narrow  down  the  possible  meanings  of  what  he  is  saying.  Our  ears,  of  course,  register  the 
precise  acoustic  wave-forms  he  is  uttering.  Our  English  vocabulary  forces  us  to  interpret 
imperfect  signals  as  real  words.  Our  eyes  see  his  gestures  and  hi-  lip  movements,  and  give 
us  more  information.  All  these  different  sources  of  information  must  be  used,  and  yet  they 
all  are  talking  In  different  "languages"  to  us.  The  most  trivial  solution  is  to  keep  all  the 
sources  independent,  and  keep  working  until  one  of  them  can  solve  the  problem  all  by  itself. 
A  much  better  solution  is  to  transform  ail  their  babblings  into  one  canonical  representation, 
one  single  language.  This  way,  all  the  knowledge  sources  can  cooperate. 

Coupled:  two  functional  subsystems  are  causally  connected;  one  influences  the  other.  See  the 
entry  for  "Linear". 
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CPU  time:  Central-Processing-Unit  runtime  (cpu  time)  is  the  number  of  execution  cycles  of 
the  computer  that  the  AM  program  has  used  up.  This  is  conveniently  measured  in  seconds, 
minutes,  and  hours,  where  one  cpu  minute  is  the  amount  of  processing  done  in  one  minute 
of  real  time,  when  AM  has  lOOX  of  the  machine,  and  is  runninng  without  any  input  or 
output. 

CS:  an  abbreviation  for  Computer  Science. 

Execution:  a  program  is  actually  used  by  running  it  on  a  particular  set  of  input  data.  This 
process  is  known  as  program  execution. 

Facet:  Within  the  context  of  this  document,  the  word  "facet"  denotes  a  slot  of  the  kind  of 
data-structure  known  as  "concepts"  (qv).  Thus  "u  facet  of  the  Compose  concept"  really  just 
means  a  slot  of  a  particular  frame,  a  part  of  certain  BEING,  one  single  attribute/value  pair 
taken  from  the  property  list  of  the  Lisp  atom  named  Compose.  Semantically,  each  facet 
holds  information  pertaining  to  a  single  aspect  of  the  concept  it  is  a  part  of;  hence  the 
suggestive  name:  "facet". 

FRAMES:  A  modular  representation  of  knowledge.  Each  module  is  a  list  of  Feature/ Value 
pairs.  The  value  represents  a  default  assumption  which  can  be  relied  on  until/unless  new 
information  comes  in  about  that  feature.  Each  frame  has  whatever /eatures  (called  "slots") 
seem  appropriate.  Whenever  a  situation  S  is  encountered,  the  frame{s)  for  S  are  activated. 
As  new  information  lolls  in,  it  replaces  the  default  information  in  various  slots.  Notice  the 
emphasis  on  distributing  static  knowledge  {data),  not  necessarily  control,  in  such  a  system. 
See  [Piaget  55]  or  [Minsky  75]. 

Function:  a  small,  executable  part  of  a  program.  When  fed  the  proper  kind  of  argument(s), 
a  function  will  "run"  and  ultimately  produce  some  sort  of  value.  Unlike  pure  mathematical 
functions  (see  the  previous  glossary),  a  Lisp  function  can  have  side  effects  (qv). 

Garbage  collection:  As  a  Lisp  program  executes,  various  list  structures  (pointer  networks) 
are  created.  When  the  last  pointer  to  a  structure  is  removed,  that  structure  has  essentially 
been  irretrievably  forgotten.  If  the  operating  system  knew  which  storage  cells  were  thus 
"free",  it  could  re-cycle  them,  reuse  them.  The  process  of  finding  and  liberating  such 
discarded  lists  is  called  garbage  collection.  This  Is  performed  automatically  by  the  Lisp 
language,  whenever  space  is  almost  all  filled  up. 

Hack:  A  quick  Job  that  produces  what  is  needed,  but  not  well.  Introducing  a  heuristic  which 
was  only  used  once,  in  a  predetermined  way  (e.g.,  to  fix  a  particular  bug),  would  be  a  real 
hack. 

Hand-crafting:  the  human  programmer  carefully  designs  his  system  in  such  a  way  that  the 
pieces  just  manage  to  mesh.  For  instance:  he  provides  just  the  perfect  set  of  axioms  so  that 
his  theorem-prover  can  solve  a  certain  problem,  or  he  modifies  the  program’s  strategies  so 
that  they  efficiently  manipulate  the  axiom  set  in  just  the  right  way. 

Heterarchy:  A  kind  of  control  structure  for  a  computer  program  which  is  distinct  from 
hierearchy.  Heterarchical  structuring  views  the  whole  program  as  a  collection  of  equal 
partners,  an  unstructured  set  of  functions.  "Control"  is  viewed  as  a  spotlight,  which  can  be 
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flicked  from  one  function  to  another.  The  functions  can  affect  who  does  or  doesn’t  get 
control  next,  but  there  is  no  guarantee  who  will  get  control,  or  that  control  will  revert  back 
to  some  function  which  once  had  it.  Aside  from  the  lure  of  its  democratic  flavor,  it  is 
clearly  a  natural  way  to  represent  cooperating  knowledge  modules. 

Hierarchy:  This  term  refers  to  a  kind  of  control  structure  for  a  computer  program.  The 
typical  hierarchical  structure  is  one  in  which  a  function  calls  a  subroutine,  which  processes 
and  then  returns  a  value  to  that  function.  A  program  is  viewed  as  a  tree  structure,  with 
lines  indicating  "calling". 

Interact:  a  dynamic  mode  of  communication  between  a  human  and  a  computer  program. 
The  human  reacts  to  what  the  program  is  printing  out  on  his  terminal,  and  the  program  in 
turn  reacts  to  what  the  user  types  in.  This  may  take  the  form  of  questioning  and  answering, 
or  interrupting  and  commenting. 

Interestingness:  Note  that  this  is  not  a  valid  English  word.  In  the  context  of  AM,  it  refers  to 
a  numeric  value,  computed  by  little  Lisp  programs  stored  in  the  "Interest"  facets  of  various 
concepts.  Despite  the  danger  of  imbuing  such  a  humble  scheme  with  all  the  mystique  of 
what  is  and  isn’t  interesting,  it  is  felt  that  a  sufficient  component  of  that  evaluation  has  been 
captured  to  warrant  the  name.  Pragmatically,  it  is  of  much  more  use  to  the  user  to  see 
"Interestingness  of  Compose  has  just  risen"  than  to  see  a  message  like  "G00034 
incremented". 

Kludge  (or  Kluge):  This  is  a  program  feature  which  is  an  unfair  shortcut  around  a  specific 
problem.  One  "kludgy"  way  of  improving  the  algorithm  of  a  given  concept  is  to  ask  the  user 
for  a  better  algorithm. 

Linear:  a  system  whose  components,  inputs,  and  outputs  superimpose  -  i.e.,  don’t  couple. 

Lisp:  a  LISt-Processing  programming  language.  Primitive  operations  exist  for  manipulating 
nested  list  structures.  Since  Lisp  functions  are  also  merely  lists,  it  is  easy  to  create  and 
modify  entities  which  are  then  executed  (qv). 

Modular  Representations  of  Knowledge  in  AI  Systems:  Knowledge  is  partitioned  into 
packets  (called  modules,  frames,  units,  productions,  Beings,  experts.  Actors)  along  lines  of: 
different  applicabilities,  expertise,  purpose,  importance,  generality,  etc.  Each  packet  is 
structurally  similar  to  all  the  rest.  Advantages:  By  having  the  knowledge  discretized,  pieces 
can  be  added  and/or  removed  with  no  trouble.  The  knowledge  of  the  system  is  easily 
inspected  and  analyzed.  The  structural  similarity  yields  several  advantages:  a  simple  control 
system  suffices  to  "run"  all  the  knowledge,  the  modules  can  intercommunicate  easily,  new 
modules  can  be  inserted  without  knowing  precisely  "who  else"  is  already  in  the  system.  In 
general,  the  less  similarly-structured  the  modules  are,  the  simpler  the  inter-communication 
media  must  be.  Modular  representation  is  a  natural  way  to  implement  cooperating 
knowledge  sources. 

Number:  in  the  typical  loose  fashion  of  computer  scientists,  I  intend  this  to  mean  a  non¬ 
negative  integer:  i.e.,  a  natural  number. 

Open  research  problem:  a  limitation  of  the  AM  system. 


Appendix  I 


AM  Ciicovery  in  Methematic*  at  Heuriitic  Search 


-171- 


Recur:  Often,  part  of  a  definition  will  refer  back  to  that  very  same  definition.  This  may 
lead  to  an  infinite  circular  loop,  or  it  may  terminate.  The  following  definition  of  "is  larger 
than"  is  recursive,  because  the  last  line  recurs: 


s«t  R  is  larger  than  set  S 
if  R={}  bul  S/{},  or 
if  neither  is  empty  and 

Remove'elementCR)  is  larger  than  Remove>element(S). 


Recurse;  a  transitive  verb  which  means  "to  swear  again."  It  must  be  distinguished  from 
"recur",  above. 

Side  effects:  while  a  function  is  executing,  it  may  cause  changes  in  the  state  of  its 
environii.ent  which  persist  even  after  the  function  has  returned  a  value.  This  is  like 
hysteresis  effects.  For  example,  a  function  may  create  or  destroy  some  list  structure,  define  a 
new  function,  reset  some  variable,  etc.  Such  activities  are  called  side  effects  of  the  function. 

Space:  The  memory  of  a  computer  is  quite  finite.  Though  it  may  be  supplemented  by  slow 
auxilliary  devices  (tapes,  discs,  etc.),  the  actual  number  of  storage  cells  in  the  computer’s  fast 
"core"  memory  is  a  limiting  factor  in  program  behavior.  Storage  space,  or  just  "space",  refers 
to  these  internal  memory  cells.  When  space  is  exhausted,  the  only  remedy  is  to  perform  a 
garbage  collection  (qv). 

System:  this  can  mean  a  computer  program,  and  occasionally  is  just  an  another  way  of 
referring  to  AM.  In  general,  a  system  is  any  collection  of  entities  related  to  form  a 
meaningful  whole. 

Terminal:  a  communications  device  for  passing  information  between  a  computer  system  and 
a  human.  This  could  be  a  teletype,  a  TV  screen  and  keyboard,  etc.  The  terminal  is  usually 
portable  and  remotely  located  from  the  computer. 

User:  the  human  being  who  sits  at  a  computer  terminal  and  watches  AM  run  (occasionally, 
perhaps,  interacting  with  AM). 
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Appendi:.'  2.  AM’s  Concepts 


The  t.rst  part  of  this  huge  appendix  (Appendix  2.1.2  to  2.1.75)  lists  the  set  of  knowledge 
AM  started  with:  its  initial  concepts.  It  is  not  very  readable,  nor  is  it  central  to  any  of  the 
ideas  on  which  AM  is  based.  The  reader  is  therefore  warned  to  proceed  at  his  own  risk 
through  this  material. 

Section  2  of  this  appendix  contains  a  brief  description  of  tho.se  concepts  which  were  only 
partially  implemented  in  AM  (e.g.,  "Destructive-op").  It  was  decided  not  to  give  each  of 
them  a  full  "box"  of  their  own. 

The  third  part  of  this  appendix  lists  a  couple  concepts  as  they  were  actually  coded  into 
Lisp.  The  reader  is  shown  which  entry  -  or  heuristic  rule  -  each  bit  of  Lisp  code 
corresponds  to. 

Finally,  starting  on  page  224,  a  list  is  provided  of  some  of  the  concepts  which  AM  created. 
This  is  intended  not  as  an  exhaustive  catalog,  but  merely  to  show  the  breadth  of  what  was 
done  by  AM,  the  smart  guesses  and  the  lunacies.  This  list  could  have  been  pieced  together 
by  '.ludying  Appendix  5,  vr.erein  some  examples  of  AM  in  action  are  given.  There  the 
reader  may  dynamically  observe  what  kinds  of  concepts  -  and  infer  what  kinds  of  entries 
for  their  facets  -  AM  was  able  to  derive  from  its  initial  base. 

Apoeiidix  2.1.  Initial  Concepts 

Each  concept  will  be  listed,  followed  by  a  description  of  the  entries  in  each  of  its  facets*. 
For  each  such  "slot",  a  condensation  is  provided  (in  English,  LI.^P,  and  math  notation)  of 
all  the  knowledge  initially  supplied  to  AM  about  that  facet  of  that  concept. 

If  there  is  any  unmentioned  facet  for  a  concept,  then  it  started  out  blank.  Many  of  the 
facets  01  ihp  original  concepts  were  left  blank  intentionally,  knowing  that  AM  would  be  able 
to  fill  them  in  as  well.  After  all,  if  you  can  fill  in  examples  of  any  new  concept,  you  ought  to 
be  able  to  fill  in  examples  of  Sets! 

The  concepts  are  grouped  semantically,  much  like  the  tree  shown  on  page  105,  like  the 
order  in  which  heuristics  are  listed  in  Appendix  3.  This  section  of  the  appendix  is 
prefaced  by  an  index  which  is  arranged  alphabetically,  since  the  primary  use  of  it  will 
probably  be  as  an  encyclopedia.  When  the  reader  encounters  a  poorly-nampd  or  poorly- 
explained  concept  somewhere  in  the  text,  he  may  wish  to  glance  first  at  Chapter  5,  page  107, 
where  very  brief  definitions  of  the  concepts  are  also  given  alphabetically.  If  that 
"dictionary"  is  ins'ifficent,  he  can  turn  to  the  appropriate  page  in  this  appendix,  and  see  the 
same  concept  presented  in  much  more  detail. 


I 


Each  of  Ihttt  on'.-itt  was  auppliad  by  hand,  by  Iht  auihor. 
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Appendix  2.1.1  Index  to  Initial  Concepts 


CONCEPT 


PAGE  CONCEPT  PAGE 


A(  five .  175 

A  I!  but-the-first-element  . 201 

All-but-the-last-element ....  202 

Any-concept .  174 

Anything . 17-2 

A  tom-obj .  -8 

Bag-Delete .  *<4 

Bag-DilT . 

B.ig-in.'-ert .  182 

Brfg-lnteisect .  189 

Bag-Union .  191 

Bags . 212 

Canonii'p  196 

Coalesce  .  195 

Coiiipo.se . 178 

Coiijectuie . 207 

Constant-False . 177 

Constant-piedicate .  176 

Constant-True . 176 

Delete .  183 

Difference .  192 

Empty-structure . 211 

First-element . 201 

Identity . 204 

Insert .  179 

Intersect .  186 

Invert-an-operation . 205 

Inverted-op . 205 

Last-element . 200 

List-Delete .  184 

List-Diff .  192 

List-insert . 182 

List-Inteisect . 186 

Ltst-Union . 190 

Lists . 213 

Logical-combination . 206 

Member . 202 


Multiple-elements-structure .  210 

No-rnultip!e-elements-structure . 211 

Nonempty-strucnjre . 21 1 

Objeci .  207 

Object-equality .  176 

Operation .  177 

Ord-Structure . 210 

Ordered-pairs . 213 

Oset-Deletc .  185 

Oset-Diff .  193 

Oset-insert .  181 

Oset-lntersect .  187 

Oset-Union .  190 

Osets .  214 

Parallel-Join .  199 

Parallel- join2 .  199 

Parallel-replace .  197 

Parallel-replace2 .  197 

Predicate .  175 

Projection! .  203 

Projection2 .  203 

Relation . 206 

Repeat .  198 

Repeat2 .  198 

Restrict . 204 

Reverse-ord-pair .  200 

Set-Delete .  183 

Set-Diff .  193 

Set-insert .  180 

Set-lntersect .  188 

Set  Union .  191 

Sets . 212 

Structure .  209 

Structure-of-Structures . 209 

Truth-value . 208 

Union .  189 

Unord-Structure . 210 
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Apoeiidix  2.1.2  Anything 

Name(s):  Anything,  Entity,  Thing,  Item 
Definitions: 

Non-Recursive,  Trivial,  Quick:  X  ()  T 
Specializations:  Any-coneept,  Non-concepts 
Generalizations:  none 
Examples:  Anything,  Any-concept 
Isa's:  Any-concept 
Worth:  100 

Interest:  5  heuristics  (see  Appendix  3.1,  page  229).^ 

Sugg:  5  heuristics 

In-domain-of:  Delete,  Insert^,  Member,  Projl,  Proj2,  Identity,  Constant-pred. 
In-range-of:  First-ele,  Last-ele,  Member,  Projl,  Proj2,  Identity. 


Appendix  2.1.3  Anv-concept 

Name(s):  Any-concept,  Any-Being,  Anybody 
Definitions: 

Non-Recursive,  Opaque,  Quick:  X  (x)  FMEMB(x,Concepts} 
Non-Recursive,  Opaque,  Quick:  X  (x)  GETP(x,Name) 

Specializations:  Active,  Object 

Generalizations:  Anything 

Examples:  Anything,  Any-concept,  Active,  Object 

Isa's:  Anything,  Any-concept 

Worth:  100 

View:  to  view  any  X  as  if  it  were  a  Y,  find  an  op.  whose  domain  contains^  X, 
and  whose  range  is  contained  in  Y,  and  apply  that  op.  to  the  given  X. 
Fillin:  39  heuristics  (see  Appendix  3.2,  beginning  on  page  230).^ 

Check:  20  heuristics 
Interest:  2i  heuristics 
Sugg:  30  heuristics 


2 

In  gtn«r*l,  thi*  «pp«ndix  will  omit  h«uri*tict  Th«y  will  m*tt*d  b«  protontsd  m  ont  big  colloct'ion,  i«  tht  noxt  ippendix.  For 
•Jch  concept,  we  will  however  mention  how  miny  heurittic*  of  eich  variety  are  pretent.  The  intereeted 
reader  may  turn  immediately  to  Appendix  3  if  he  detiret,  to  tee  thote  '  lurietic  rulee. 

2 

All  four  epecialitatione  of  each  of  Delete  (eg.,  Bag>delete)  end  Ineert  (eg,  Liet-intert)  are  alto  litted  here. 

That  it,  the  domain  of  the  operation  it  01x02x03..,  and  X  it  a  tubtet  of  tome  Di,  a  tpecialization  of  Di 

5 

At  utuil,  the  heuri-  t  are  litted  in  Appendix  3,  not  here.  But  the  reader  ie  forewarned  that  thie  concept  hat  to  many 
heurii  .ict  that  they  are  grouped  by  facet  in  the  next  appendix,  occupying  Appendicee  3.2  I  through 
3  2  8,  ptgee  230  to  251. 
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Appendix  2.1.4  Active 


Name(s):  Activ*,  activity,  action 
Definitions: 

Sufficient,  Non-Recursive,  Quick:  X  (x)  GETP(x,Algorithms) 
Sufficient,  Non-Recursive,  Quick:  X  (x)  GETP(x,Dom/range} 
Specializations:  Predicate,  Relation,  Operation 
Generalizations:  Any-concept 
Examples:  none.® 

Isa’s:  Any-concept 

In-domain-of:  Constructive,  Destructive,  Coalesce,  Compose,  Restrict 
In-range-of:  Compose,  Coalesce,  Restrict. 

Worth:  100 
Fillin:  7  heuristics. 

Check:  4  heuristics 
Interest:  3  heuristics 
Sugg:  10  heuristics 


Appendix  2.1.5  Predicate 

Name(s):  Predicate,  sometimes:  logical  operation.  Boolean  function. 

Definitions: 

Nonrecursive  quick  opaque:  X  (P)  Range(P)  is  Truth-value;  i.e.,  {T,K}. 
Generalizations:  Active 

Examples:  Equality,  Constructive,  Destructive,  Empty,  Nonempty,  Constant-pred, 
the  Defn  entries  of  each  concept.^ 

In-domain-of:  Canonize 
Worth:  100 
Fillin:  2  heuristics. 

Sugg:  1  heuristic. 

Interest:  1  heuristic. 


6 

7 


Recall  that  each  active  will  be  an  example  of  an  operation,  predicate,  etc,  hence  need  not  be  pointed  to  explicitly  here 

Thu*  the  predicate  'Empty',  while  it  *xi*t«  in  AM,  i«  euperflout,  tine*  the  definition  facet  of  'Empty-«true'  containe  that 
very  predicate 


A  T  KT 


K 
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Appendix  2.1.6 . Object-equality 


Nam«(c):  Equality,  Object  equality,  Obj*equal,  Equal,  Same. 

Definitions: 

Nonrecursive  opaque:  X.  (x,y)  EQUAL(x,y) 

Sufficient,  very  quick,  opaque:  X  (x,y)  EQ(x,y)).' 

Recursive  slow:  X  (x,y)  x  and  y  are  both  identical  atoms, 
or  X  and  y  are  both  empty  structures, 
or  X  and  y  are  both  nonempty  structures  and 
Equality.Defn(CAR(x),CAR(y))  and 
Equality.Defn(CDR(x),CDR(y)). 

Nonrecursive  transform  slow:  X  (x  y)  ldentity.Defn(x,y). 

Quick:  X  (x,y)  y»Equality.Alts(x). 

Domain/rance:  CObject  Object  •+  {T,F}> 

<Structure  Structure  -♦  {T,F}> 

Algorithms: 

Nonrecursive  quick:  X  (x)  x. 

Conjee:  'Identity,  restricted  to  Objects,  is  the  same  as  Obj-Equslity.' 
Isa's:  Predicate 
Worth:  200 

What:  the  Equality  of  two  list  structures;  closely  related  to  Identity  op. 


Appendix  2.1.7  Constant-predicate 

Namefs):  Constant'predicate,  Const  pred.  Logical  constant  function. 
Definitions:  none. 

Domain/range:  <Anything...  Anything  -» {T,F}> 

Isa's:  Predicate 

Specializations:  Constant'True,  Constant*False 

Conjee:  (Vx,Vy)  Constant-pred.Defn(x)»Conctan*-pred.Defn(y). 

Worth:  100 

What:  a  predicate  which  always  returns  the  same  logical  value. 


Appendix  2.1.8  Constant-True 

Name(e):  Constant-True,  Constant  T,  Always-T,  sometimes:  Always. 
Definitions: 

Nonreeursive,  very  quick:  X  (...)  T. 

Domain/range:  <Anything...  Anything  -♦  {T,F}> 

<Anything...  Anything  -♦  {T}> 

Generalizations:  Constant-Predicate 
Worth:  100 

What:  a  predicate  which  always  returns  True. 


i 
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Appendix  2.1.9  Constant-False 


Name(s):  Constant-False,  Constant  F,  Always-F,  sometimes:  Never. 
Definitions: 

Nonreeursive,  very  quick:  X  (...)  F^ 

Domain/range:  <Anything...  Anything  -»  {T,F}> 

<Anything...  Anything  {F}> 

Generalizations:  Constant-Predicate 
Worth:  100 

What:  a  predicate  which  always  returns  False. 


Appendix  2.1.10  Operation 

Name(s):  Operation,  sometimes:  function,  mapping. 

Definitions:  none.^ 

Specializations:  Inverted-op,  Composition,  Canonization, 

Coalesced-op,  Constructive-cp*^ 

Generalizations:  Active 

Examples:  Insert,  Delete,  Union,  Intersect,  Uitterence,  Compose,  Canonize, 
Coalesce,  Identity,  Proji,  Proj2,  First-ele,  Last-els,  .All-but-first-ele, 
All-but-last-eie,  Restrict,  Reverse-ord-pair,  Member,  Invert,  Repeat(2}, 
Parallel-join(2},  Parallel«replsce(2). 

In-domaln-of:  Invert,  Parallel-join(2),  Parallel-replaee(2),  Repeat(2). 
In-range-of:  Canonize,  Invert,  Parallei-join(2),  Parallel-replsce(2),  Repest(2) 
Worth:  100 
Fillin:  7  heuristics. 

Check:  3  heuristics 
Interest:  11  heuristics 
Sugg:  2  heuristics 


fi 

Actuilly,  1h«  vilu«  rtturnad  it  'NIL',  not  Faitt  or  F. 

Q 

Roctll  that  all  thia  maant  ia  that  computationally,  any  entity  x  ia  conaidarad  to  ba  an  Operation  iff  it  ia  in  Operation.Exa,  or 
If  it  is  an  example  of  asms  Spacialization  of  thia  concept. 

The  concepts  of  Constructive  and  Daatructivs  operationa  are  not  encoded  as  concepts  yet.  The  distinction  between 
specialization  of  Operation  and  Example  of  operation  ia  quite  blurry.  E.f,  why  not  consider  th  class  of 
Insertion  operations  a  whole  spacialization  of  Operation,  instead  of  jut\  an  example?  The  decision  as  to  what 
status  each  operation  would  have  was  quite  arbitrary,  Pm  afraid. 
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Appcndix  2.1.11  Compose 

NamaCs):  Compost,  Composition,  somatimast  afiarwards; 

Oafiniiions: 

Daclarativa  slow:  X  (A,B,C)  Vx,  C(x)«A(B(x). 

Sufficiani  Nonraeursiva  Quick:  X  (A,3,C)  C  has  tha  Name  'AoB’. 
Sufficiant,  Slow:  Ara*aquivalant(C,Composa.Alss(A,B)). 

Suffieiant,  Quick:  C«Composa.Algs(A,B}. 

Domain/ranga:  <Activa  Acliva  -»  ActivaX 

<Operation  Activa  -♦  Operation)' * 

<Pradicata  Activa  -♦  PradicataX 
<Ralation  Ralation  -»  RalationX 

Algorithms: 

Distributed:  use  tha  hauristics  attachad  to  Composa  to  guida  tha  filling 
in  of  various  facais  of  tha  naw  composition. 

Genaralizations:  Oparation 
Isa’s:  Operation 
Worth:  300 
Fiilin:  9  heuristics. 

Check:  2  heuristic. 

Suggest:  2  hauristics. 

Interest:  11  hauristics. 


' '  Note  Ihit  while  Ihit  entry  would  imply  th«t  Operttion.ln-r«n-of  end  Operition.ln-dom-of  could  both  conttin  'Compote'  «e 
«n  entry,  only  the  meet  fenerel  concept  (i.e.,  'Active'/  .Mt  'Compote'  in  itt  In-dem-of  end  In-ren-of  facett. 

12 

An  atforithin  for  COMPOSE  it  a  proetdure  for  takinf  a  pair  of  optrationt,  i.e.  a  pair  of  conceptt  G  and  H,  and  creating  a 
new  Ktive  concept  F  which  it  defined  to  be  their  competition,  whote  Algorithme  facet  centaine  'X  (x) 
G(H(x))',  or,  more  preciaely,  '(APPLYB  G  ALGS  (APPLYB  H  ALGS  x))'. 


VFn^V  Vt’V^.%'l’^nN 
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AppendtA  2.1.12  Insert 


Nam«(s):  Ineart,  Insartion,  somatimac:  AckI,  Mar|a$ 

Dafinitions: 

Quasi-racursiva  casas:  \  (x,A|B)  [daiarmina  tha  typa  of  structura  that  A  and 
B  ara,  say  S,  than  usa  $~insart.Defn(x,A,B)]. 

Necessary,  Nonracursiva,  Quick:  X  (x,A,B)  Membar.Dafn(x,B). 

Necessary  Oeclarativa:  X  {x,A,B)  z(B  iff  zf  A  or  z«x. 

Necesary,  Oeclarativa:  X  (x,A,B)  [(Va<A)(a<B),  and  (Vbjfx  <B)(b<A},  and  x(B] 
Sufficient,  Quick:  B«lnsart.Alts(x,A). 

Domain/ranfa:  <Anythin{,  Structures  -»  Structurae> 

Algorithms: 

Quasi'racursiva  eases:  X  (x,A)  [determine  tha  typa  of  structure  A  is, 
say  S,  than  usa  S'lnsartAIgsiXiA)]. 

Isa's:  Operation 

Specializations:  Bag-insert,  Set-insert,  List-insert,  Osat-insart. 

Worth:  100 
Check:  1  heuristic. 


v*  ’.-v  \.-v  \,-wT  -s.  T  vt  ■V’,  «r  V!  »'.'i  ■^.  T  •  .'i  ■<'^‘ »'  ;T~r»" j  tt: 
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Appendix  2.1.13  Set-insert 


Nam«(t):  S«t-ins«rt,  Sat  Intartion,  tomalimae:  Intart,  Tag. 

Dafinitiontt 

Daclarativa  Slow:  X  (x,A,8)  [(Va<A)(a<B),  and  (Vb/x  <B)(b<A),  and  x<B] 
Racurtiva  Slow:  X  (x,A,B)  (A>{}  and  B*{x},  or  alta: 

[AND:  2«-Mambar.Alf(A}i  Mombar,Dafn(z,8)$ 

Sat*insort.Da(n(x,Sat>dalata.Alg(z,A),Sat*dalata.Alg(z,B))  ]) 

Racurtiva:  X  (x,A,B)  (A>{}  and  B«{x},  or  alta: 

[AND:  z^CAR(A);  Mambar.Dafn(z,B); 

Sat-intart.Da{n(x,CDR(A),Sat-dalata.Alg(z,B))  ]) 

Daclarativa:  X  (x,A,B)  (Vz)  z<B  iff  z^A  •  z«x. 

Quick:  B*Sat>intart.Aigt(x,A). 

Domain/ranga:  <Anything,  Satt  Satc> 

Algorithmt:’® 

Non-racurtiva  quick:  X  (x,A)  (if  Mambar.Dafn(x,A)  than  A,  alta  MERGE(x,A)}) 
Non-racurtive  quick:  X  (x,A)  (MERGE(x,A)  and  Elim'adjacant-mult*’alamantc(A)) 
Racurtiva:  X  (x,A)  (if  A>{}  than  {x},  alta  if  A«{x}  than  A,  alta 
[z«-CAR(A);  if  z«x  than  A,  alta  CONS(z,Sat'*intart.Alg(x,CDR(A)))]). 
Ganaralizationt:  Insart 
Worth:  100 

What:’^  If  X  itn't  alraady  in  A,  than  add  it  and  raMort  tha  lat  A. 


13 

Actually,  thla  operation,  like  all  the  other  atruetural  oparationa,  ara  much  mora  aophlatkatad  than  thia  aimpla  praaantation 
impliaa  In  thia  caaa,  if  A  ia  not  tuppliad,  AM  chootaa  a  random  axampla  of  a  Sat  and  inaarta  x  into  that  aat. 
if  X  ia  miaaini,  than  AM  finda  a  random  axampla  of  Anythin!  and  inaarta  H  into  A. 

14 

Tha  'What'  faeat  doaan’t  really  axiat,  but  ia  occaaionally  praaant  in  thia  Appendix  for  tha  aid  of  tha  reader.  A  fuller 
Enfliah  daacription  of  any  concept  can  be  obtained  by  lookini  in  tha  alphabetical  aummary  of  concapta,  in 
Chaptar  5,  bafinninf  on  pafa  107. 


-vt:v:rs^  ■; 


K'’  ^  'T'  o »;-,  v  "*.'. 
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Appendix  2.1.14  Oset-insert 

Name(s):  Oset-insert,  Oset  insertion,  sometimes:  Insert; 

Definitions: 

Oelcarative  Slow:  X  (x,A,B)  [(Va<A)(a<B),  and  (Vb/x  <B)(b(A),  and  xeCAR(B}] 
Recursive  Slow:  X  (x,A,B)  (Ae[]  and  Be[x],  or  else: 

[AND:  z«-Member.AI{(A);  Member.Oefn(z,B); 

Oset-insert.Defn(x,Oset‘dsleis.Alg(z,A),Oset-delete.Alg(z,B)) }) 

Non-recursive,  Quick:  X  (x,A,B)  (BcCONS(x,Oset-delete.Algs(x,A}). 

Quick:  X  (x,A,B)  (B«Oset-inseri.Algs(x,A)). 

Necessary  Quick:  X  (x,A,B)  (x«CAR(0}}. 

Necessary,  Declarative:  X  (x,A,B)  (Vz)  z(B  iff  z<A  9  z«x. 

Domain/range:  <Anything,  Oseti  -*  Osets> 

Algorithms: 

Non-recursive  quick:  X  (x,A)  (C0NS(x,[if  Member.Defn(x,A)  then  DREMOVE(x,A)'^ 
else  A])) 

Non-recursive  quick:  X  (x,A)  (C0NS(x,A}  and  DREMOVE(x,CDR(A)}} 

Non-recursive  quick:  X  (x,A)  (CONS(x,DREMOVE(x,A)) 

Recursive:  X  (x,A)  (if  A«[]  then  [x],  else  if  A«[x...]  then  A,  else 
CONS  (x,Oset-delete.Algs(x,A))). 

Generalizations:  Insert 
Worth:  100 

What:  Eliminate  x  from  A  and  add  x  as  the  first  element  of  the  oset  A. 


15 


Th«  INTERLISP  function  DREMOVE(x,A)  d«*tructiv*ly  rtmovt*  *11  occurronco^  of  x  froin  th«  li«t  tlructura  A. 
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Appendix  2.1.15  List-insert 


Nam*(«):  Liit-inscrt,  List  inttriion,  tomttinws:  Instrt,  tomttimtt:  CONS; 
O«finitionst 

Nonr«eur(iv«  QuicK:  X  (x,A,B}  (B«C0NS(x,A)). 

Nonr«curtiv«!  X  (x,A,B)  (A>CDR(B)  ind  x>CAR(B)}.’* 

QuicK:  X  (x,A,B}  {B«List-ins«rt.AI|s(x,A)). 

Nactisir/  Quick:  X  (x,A,B)  (x«CAR(B)). 

Nactttiry  QuicK:  X  (x,A,B)  (A«CDR(B)). 

Domain/ranga:  <Anyihin|,  Liata  Liata> 

Alforithma: 

Non-racuraiva  quick:  X  (x,A)  C0NS(x,A). 

Racuraiva  alow:  X  (x,A)  (if  A*<>  than  <x>,  alaa 
NCONCl‘^Liat>incart.AI|a(x,AII-but-lact.Alga(A)),CAR(A)). 
Ganaralizationa:  Insari 
Worth:  100 

What:  Add  tha  alamant  x  onto  tha  front  of  iha  Llat  A. 


Appendix  2.1.16  Bag-jnsett 


Nama(s):  Bag'insart,  Bag  insartion,  aomatimas:  Inaart; 

Dafinitions: 

Nonracuraiva  QuicK:  X  (x,A,B)  (B«S0RT(C0NS(x,A))). 

QuicK:  X  (x,A,B)  (BaBag-inaart.AigsfXiA}). 

Domain/ranga:  <Anything,  Baga  -*  Bags> 

Algorithms: 

Non-racuraiva  quick:  X  (x,A)  MERGE(x,A). 

Non-racuraiva:  X  (x,A)  SORT(CONS(x,A)). 

Racuraiva  alow:  X  (x,A)  (if  A*()  than  (x),  alaa 
if  CAR(A)<'‘x  than  CONS(CAR(A),Bag-inaarUlga(x,COR(A))},  alsa  CONS(x,A)). 
Ganaralizationa:  Insert 
Worth:  100 

What:  Marga  tha  alamant  x  into  tha  Bag  A. 


K«iV(  how  thi*  would  foilly  «pp«*r  in  LISP:  (LAMBDA  (x  A  8)  (AND  [APPLV6  06J-EQUAL  ALGS  A  (APPLVB  ALL-BUT- 
FIRST.ELE  ALGS  B)}  (APPLVB  OBJ-EQUAL  ALGS  x  (APPLVB  FIRST-ELE  ALGS  B)]». 

Thi*  LISP  function  ntoxn*  'X(S,x)  •dd-lh«  olomont  (  to  th«  ind  of  litt  S'.  CDR  mooni  All-but-tho-firtt-olomont,  CAR 
moon*  Tho-firot-olomont. 

*  ^  Htro,  'loo*  Ihon'  moon*  'procodot  ilphtnumorictlly',  uting  ALPHORDER. 
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ADoendix  2.1.19  Bag-Delete 


Nim«(s):  Bag-DtItU,  Bag  Delation,  somatimas:  Delata; 

Dafinitions: 

Recuraiva  Slow:  \  (x,A,B)  (Ab()  and  Bb(),  or  also  (Ab(x...)  and  B>CDR(A}, 
or  alsa  Bag-dalata.Dafn(x,COR(A),COR(B). 

Quick:  B«Bag*Dalala.Algi(x,A). 

Dotnain/ranga:  <Anything,  Bags  -*  Bags) 

Algorithms: 

Non*raeursiva  quick  opaqua^^:  X  (x,A)  [2«-(MEMBER(x,A)$ 
RPLACA(z,CADR(z))s  RPLACD(z,CDDR(Z))] 

Racursiva:  X  (x,A)  (if  A*()  than  0,  alsa  If  CAR(A)bx  than  CDR(A),  alsa 
CONS(CAR(A},Bag-Dalata.Alg(x,COR(A}})}.  , 

Ganaralizations:  Dalata 
Worth:  100 

What:  ramova  ona  copy  of  x  from  tha  Bag  A,  if  x  was  int  thara  initially. 


Nama(s):  List-Oalata,  List  Dalation,  somatimas:  Dalata; 

Dafinitions: 

Racursiva  Slow:  X  (x,A,B)  (Ab<>  and  B><>,  or  alsa  CAR(A)ax  and  CDR(A}>B, 
or  alsa  List-dalata.Dafn(x,COR(A),COR(B}. 

Quick:  8«List*Dalata.Algs(x,A). 

Domain/ranga:  <Anything,  Lists  -*  Lists) 

Algorithms: 

Non-racursiva  quick  opsqua:  X  (x,A)  FRPLACD(z*‘(MEMBER(x,A),CODR(Z)) 
Racursiva:  X  (x,A)  (if  AbO  than  <),  alsa  if  CAR(A)bx  than  CDR(A),  alsa 
CONS(CAR(A),List-Dolata.Alg(x,CDR(A)))). 

Ganaralizations:  Dalata 
Worth:  100 

What:  rannova  tha  first  copy  of  x  from  tha  List  A,  if  x  is  in  A. 


Thtt  tlforithm  it  Itbtlltd  Optqut  btctuit  it  conliint  vtry  tifhl  'tnttky'  codt,  impltmtntinf  t  highly  non*«ttndird  linked 
dtt*  ttructur*  dtitlion  algorithm.  Th«  call  on  tht  Intarliap  function  MEMBER  bind*  t  to  th*  tail  of  A, 
beginning  with  th*  firet  occurrence  of  x. 


>•  VI V' 


'  VT  V'.  V'.  - 
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Namfl(s):  Osst-Dcisi*,  Osat  Oalation,  tomaiimat:  Dalata; 

Oafinitions: 

Recursiva  Slow:  X  (x,A,B)  (A^n  B>[},  or  alsa  CAR(A)sx  and  C0R(A)sB, 
or  alsa  08ai-dalaia.Da(n(x,CDR(A),CDR(B). 

Recursiva  Slow:  X  (x,A,B)  (A=[]  and  B=[],  or  alsa  Ab[x]  and  Bb[],  or  alsa: 
[AND:  z^Mambar.Alg(A)  until  z/x;  Membar.Dafn(z,B); 

Sat>Dalata.Dafn(x,Sat*deleta.Als(z,A),Sat'delate.Alt(z,B))  ]) 
Necassary  Quick:  X(x,A,B)  (CAR(A)*CAR(B)  xor  CAR(A)ex). 

Quick:  Be0sat*Dalata.Alc8(x,A}. 

Oomain/ranga:  [Anything,  Osats  -»  Osats] 

Algorithms: 

Non-racursiva  quick  opaqua:  X  (x,A}  DREMOVE(x,A). 

Non-racursiva  quick  opaqua:  X  (x,A}  FRPLACD(z«-(MEMBER(x,A},CDDR(z}} 
Racursiva:  X  (x,A)  (if  As[]  than  [],  alsa  if  CAR(A)sx  than  CDR(A),  alia 
CONS(CAR(A),Osat>Oalata.Alg(x,CDR(A)))). 

Non-racursiva  quick:  X  (x,A}  (if  NOT(Mombar.Dafn(x,A))  than  A, 

alsa  DREMOVE(x,A))) 

Non-racursiva  quick:  X  (x,A)  DREMOVE(x,A) 

Racursiva:  X  (x,A)  (if  As[}  than  [],  alsa  if  Ae[x}  than  [],  alsa 

[z^-CARfA);  if  z*x  than  CDR(A),  alsa  < 

if  z>x  than  A,  alsa  Os«t-Oalata.Alg(x,CDR(A)))]). 
Ganaralizations:  Dalata 
Worth:  100 

What:  ramova  tha  alamant  x  from  the  Osat  A,  if  it's  prasant  thara  initially. 


Apptndix  Z 


AM  Diacovtry  in  Mathtmttiet  i*  Hturittic  $«irch 


S  micM  b«  Of  S  e«n  bt  otc. 


Appendix  2 


AM  Diicovery  in  Mathematic*  at  Heurittic  Search 

Appendix  2.1.24  Oset-Intersect 


-187- 


Nam«(t):  Otei-lnttrsaci,  OsttHnUrsteiion,  tom«tim«e:  InUrctct. 

Definitions: 

Rflcursive^^:  X  (A,B,C)  if  A<]  than  C*[],  else 
if  CARfAX^^B  than  [CAR(A)«CAR(C)  and 
Osat>intarsact.Dafn(CDR(A),Osat>dalota.Alt(CAR(A},B),CDR(C))],  alsa 
Osat*intarsect.Dafn(CDR(A},B,C). 

Quick:  C*Osat-intersact.Algs(A,B). 

Onca  Early  Quick  Opaqua:  X  (A,B,C)  if  B  is  shorter  than  A, 
than  Osat*intarsact.Dafn(B,A,C). 

Domain/ranga:  <Osats  Osats  -*  OsatsX 
Algorithms: 

Onca  Early  Quick  Opaqua:  X  (A,B)  if  B  is  shorter  than  A, 
than  Osat‘intarsaet.Alg(B,A). 

Non-racursiva:  X  (A,B)  [for  each  x  in  A  (in  order),  do  the  following: 
if  x-'<B  than  DREMOVE(x,A).  Finally,  return  the  value  of  A. 

Non-racursiva:  X  (A,B)  [for  each  x  in  A  (in  order),  do  the  following: 
if  x<B  then  Osat-dalata.Alg(x,B),  alsa  Osat-dalata.Alg(x,A). 

Finally,  return  the  value  of  'A*  as  the  result. 

Recursive:  X  (A,B)  if  A«[]  than  [],  alsa  if  CAR(A)<B 

than  CONS(CAR(A),Osat-intarsaet.Alg(COR(A),Osat-dalata.Alg(CAR(A),B))), 
alsa  Osat-intarsaet.Alg(COR(A),B). 

Generalizations:  Intersect 
Worth:  100 

What:  Move  along  Osat  A,  eliminating  elements  not  found  in  Osat  A. 


22 

The  difference  between  thi*  definition  end  the  limilir  one  for  Li«t-int*r«*et  i*  that  here  we  cen  use  the  very  feet 
DREMOVE  elforithm  etored  in  Oeet-DeleteAlc,  where**  for  li«t*  it  w*»  neeeetery  to  ute  a  slow  Liet-delete 
elforithm. 

23 

To  **v*  «p*c*,  we  may  henceforth  write  'x(B'  to  mean  'M*mb*r.Defn(x,B)'. 
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Nam«(s):  S«t*lntersect,  Sat'lntersaetion,  comaiimas:  Intarsact. 

Dafinitiont: 

Onca  Early  Quick  Opaqua:  X  (A,B,C)  if  B  is  ahortar  than  A, 
than  Sat~intarsact.Dafn(8,A,C). 

Racursiva:  X  (A,B,C)  if  A«{}  than  Cn|},  alaa 
z^Soma-tnamb.AI{(A)$ 

If  Mambar.Dafn(z,B} 

than  [Mambar.Dafn(z,C)  and  SaHntartact.Oatn(Sat*dalata.AI|(z,A), 
Sat-dalata.Alg(z,B), 

$at*dalata.AlK(z.C})] 

alsa  Sat-intartact.Dafn(Sat*dalata.Alt(z,A),B,C}. 

Nonraeursiva  Daelarativa:  For  all  x,  xfC  Iff  x(A  and  x(B. 

Quick:  C«Sat-lntaraact.Algi(A,B). 

Oomain/ranga:  <$ats  Sati  -*  Satt> 

Algorithms: 

Onca  Early  Quick  Opaqua:  X  (AtB)  if  B  ic  ahortar  than  A, 
than  Sat*intarsact.Alg(B,A). 

Non*racursiva:  X  (A,B)  [for  aach  x  in  A,  do  tha  following: 

if  X'<<B  than  DREMOYE(x,A).  Finally,  raturn  tha  valua  of  A. 

Racursiva:  X  (A,B}  if  A«{}  than  {},  alsa  if  CAR(A)(B 

than  CON$(CAR(A),SaHntaraact.Alg(CDR(A),$at-dalata.Alg(CAR(A),B})}, 
flan  Sat*intaraact.Alg(CDR{A),B). 

Ganaralizationa:  Intarsact 
Worth:  100 

Whit:  Elimlnata  any  aiamants  of  Sat  A  which  ara  absant  from  Sat  B. 


Nam«(s):  Bag'lntcrsaet,  Bag“lnt«rs«ction,  som«titn«s:  Intersect. 

Definitions: 

Once  Early  Quick  Opaque:  X  (A,B,C)  if  B  is  shorter  than  A, 
then  Bag*intersect.Defn(6,A,C). 

Recursive:  X  (A,B,C)  if  As()  then  C«(),  else 

z«-CAR(A);  If  Member.Defn(z,B)  then  [Member.Defn(z,C)  and 
Bag'intersect.Defn(CDR(A),Bag-delete.Alg(z,B),B8g*delete.Alg(z,C))] 
else  Bag*intersect.Defn(CDR(A),B,C). 

Quick:  C«Bag-lntorsect.Alg3(A,B). 

Domain/range:  <Bags  Bags  -*  Bags> 

Algorithms: 

Once  Early  Quick  Opaque:  X  (A,B}  if  B  is  shorter  than  A, 
then  Bag-intersect.Alg(B,A}. 

Non-recursive:  X  (A,B)  [for  each  x  in  A,  do  the  following: 

if  x(B  then  B*-Bag.delete.Alg(x,B},  else  A^Bag-delete.Alg(x,A). 

Finally,  return  the  value  of  A. 

Generalizations:  Intersect 
Worth:  100 

What:  the  intersection  of  bags  A  and  B  should  contain  all  common  elements, 

with  each  element  occurring  the  ntinimum  number  of  times  it  occurs  in  A  or  B. 


Appendix  2.1.27  Union 

Name(s):  Union,  Join,  Unite,  sometimes:  Combine,  Append,  Sum. 

Definitions: 

Quasi-recursive  cases:  X  (A,B,C)  [determine  the  type  of  structure  A  and 
B  are,  say  $,  then  use  S-Union.Defn(A,B,C)]. 

Necessary,  Nonrecursive:  X  (A,B,C)  For  all  x,  x<C  iff  x(A  or  x(B 
Quick:  CsUnion.Algs(A,B}. 

Domain/range:  <Structures  Structures  -♦  Structures> 

Algorithms: 

Quasi-recursive  cases:  X  (A,B)  [determine  the  type  of  structure  A  and  B  are, 
cay  $,^^then  use  S-Union.Algs(A,B)]. 

Quasi-recursive  cases:  X  (A,B)  [determine  the  type  of  structure  A  and  B  are, 
say  S,  then  do  S*insert.Alg(CAR(A),Union(CDR(A),B))]. 
lea's:  Operation 

Specializations:  Set-Union,  Bag-Union,  List-Union,  Oset-Union. 

Worth:  100 
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Nam«(c}t  Litt'U.nion,  Append,  Ncone,  List-join,  somtiimei:  Union. 

Definitions: 

Recursive  s  3w:  X.  (A,B,C)  if  A><>  then  C^B,  else 

CAR(A}>CAR(C)  and  Liet-union.Defn{CDR(A),B,CDR(C)}. 

QuicK:  CaList'Union.Alf8(A,B). 

Domain/range:  <Lisi8  Lists  -*  ListsX 
Algorithms: 

Nonreeursive,  Quick,  Non-destructive,  Opaque:  X  (A,B)  (APPEND  A  B). 
Nonreeursive,  Quick,  Destructive,  Opaque:  X  (A,B)  (NCONC  A  B). 
Recursive:  X  (A,B)  if  A><>  then  B,  else 

CONS(CAR(A),List-UnionAlg(CDR(A),B)}. 

Generalizations:  Union 
Worth:  100 

What:  Append  list  B  to  the  end  of  list  A. 


Appendix  2.1.29  Qset-Union 


Name(s):  Oset-Union,  Oset-join,  sometimes:  Union,  Append. 

Definitions: 

Recursive  slow:  X  (A,B,C)  if  A«[]  then  C«B, 
else  CAR(A)«CAR(C)  and 

Oset-union.Defn[CDR(A), 

08et-deleteAlg(CAR(A),B}, 

08et-dalete.Alg(CAR(A},C)]. 

Quick:  C*0set-Union.Algs(A,3). 

Domain/range:  <Osets  Osets  -»  Osets) 

Algorithms: 

Nonreeursive,  Quick,  Non-destructive,  Opaque:  X  {A,B)  (APPEND  A  B). 
Nonreeursive,  Quick,  Destructive,  Opaque:  X  (A,B)  (NCONC  A  B). 
Recursive:  X  (A,B}  if  A«[]  then  B,  else 

CONS(CAR(A},Oset-Union.Atg(CDR(A),Oset-delete.Alg(CAR(A),B))). 
Generalizations:  Union 
Worth:  100 

What:  Append  onto  Oset  A  any  new  members  of  Oset  B. 
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Appendix  2.1.30  Set-Union 


Name(s}:  Set-Union,  Set-join,  sometimes:  Union,  Append. 

Definitions: 

Nonrecursive  Declarative:  X  (A,B,C)  V*,  x<C  iff  x(A  or  x<B. 

Recursive  slow:  X  (A,B,C)  if  Ae{}  then  C>>B,  else  CAR(A}<C  and 
Set-union.Defn(CDR(A}, 

Set-delete.Als(CAR(A),B),Skt*delete.Alt(CAR(A),C}). 

Quick:  C«Set-Union.Algs(A,B). 

Domain/range:  <Sets  Sets  -*  Sats> 

Algorithms: 

Nonrecursive,  Quick,  Destructive,  Opaque:  X  (A,B)  (UNION  A  B). 

Nonrecursive,  Quick,  Non-destructive,  Opaque:  X  (A,B) 

(Self-intersect  (APPEND  A  B}}. 

Recursive:  X  (.A,B)  if  Aa{}  then  B,  else 

Set-insert.Alg(CAR(A),Set-Union.Alg(CDR(A},Set-delete.Alg(CAR(A},B))}. 
Recursive:  X  (A./i)  if  A>{}  then  B,  else 

MERGE(C/\R(A),Set-Union.Alg(CDR(A),DREMOVE(CAR(A),B))). 
Generalizations:  Union 
Worth:  100 

What:  Merge  into  Set  A  any  new  members  of  Set  B. 


Appendix  2.1.31  Bag->Union 

Name(s):  Bag-Union,  Bag-join,  sometimes:  Union,  Append. 

Definitions: 

Recursive  slow:  X  (A,B,C)  if  Ae()  then  C>B,  else  CAR(A)(C  and 
Bag-union.Defnt 

Bag-delete.Alg(CAR(A),A),2^ 

Bag-delete.Alg(CAR  (A  },B ), 

Bag-delete.Alg(CAR(A),C)). 

Quick:  CeBag-Union.Aigs(A,B}. 

Domain/range:  <Bags  Bags  -*  Bags> 

Algorithms: 

Recursive:  X  (A,B)  if  A«()  then  B,  else 

Lag-insert.Alg(CAR(A},Bag-Union.Alg(CDR(A),Bag-del»te.Alg(CAR(A),B}}). 
Generalizations:  Union 
Worth:  100 

What:  Bag-union(A,B)  contains  any  x  belonging  to  either  bag,  with  multiplicity  of  x 
equal  to  the  maximum  of  the  multiplicity  of  the  element  x  in  A  end  in  B. 


“  Vet,  :hi*  ie  really  the  earn*  a*  COR(A),  and  in  the  other  concept*  in  thi*  appendix  the  shorter  form  i*  the  one  used. 
Here,  we  decided  to  show  the  nice,  tymme'rie  form  that  AM  actually  contain*. 
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Apoendix  2.1.32  Difference 

Nam«(s):  Diff«r«nce,  Structui'a-diffaranci.,  ${)m«tim«ss  Minus,  Subtract,  Compl«m«nt. 
Dafinitions: 

Quasi-raeursiva  casas:  X  (A,B,C)  [datarmina  tha  typa  of  structura  A  and 
B  ara,  say  S,  than  usa  S’Diff.Dafn(A,B,C)]. 

Nacassary,  Nonracursiva:  X  (A,B,C)  For  all  x,  xfC  iff  x(A  and  ’<x<B 
Quick:  CBDiffaranca.Alcs(A,B). 

Domain/ranea:  <Structuras  Structuras  -*  Structuras> 

Altorithms: 

Quasi-racursiva  casas:  X  (A,B)  [datarmina  tha  typa  of  structura  A  and  B  ara, 
say  S,  than  usa  S*DiffAlgs(A,B}]. 

Quasi«racursiva  casas:  X  (A,B}  [datarmina  tha  typa  of  structura  A  and  B  ara, 
say  S,  than  do  S*dalata.Alg(CAR(B},Diffaranca(A,CDR(B)))]. 

Isa’s:  Oparation 

Spacializations:  Sat-Diff,  Bag'Diff,  List-Diff,  Osat'Diff. 

Worth:  100 


Appendix  2.1.33  List-Diff 


Nama(th  List'Diffaranca,  LisHiff. 

Dafinitionit: 

Racursivt  slow:  X  (A,B,C)  if  A*<>  than  C«<>,  alsa 

If  CAR(A)(B  than  List-Diff.Dafn(CDR(A),List»dalata.AI|(CAR(A),B),C), 
alsa  CAR(A)>CAR(C)  and  Llst-Oiff.Dafn.(COR(A),B,CDR(C)). 

Quick:  CaList*Diff.Algs(A,B). 

Domain/ranga:  <Lists  Lists  -»  Lists) 

Algorithms: 

Nonracursiva:  X  (A,B)  for  x  in  A  (in  ordar),  if  x  is  in  B, 
than  usa  List-dalata  to  ramova  sn  x  from  A  and  B. 

Racursiva:  X  (A,B)  if  A»<>  than  <>,  alsa 

If  CAR(A}<B  than  List-Diff.Afg(CDR(A),List>dalata.Alg(CAR(A),B)), 
alsaCONS(CAR(A},List-Oiff.Alg(CDR(A),B)). 

Ganaralizations:  Diffaranca 
Worth:  100 

What;  Mova  x  along  A.  If  x  is  also  in  B,  ramova  it  from  A  and  from  B. 
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Appendix  2.1.34  Uset-Dlff 


Nam«(s):  Os«t-Oiff«r«nc«,  Osat-diff.  ^ 

Dafinitionsi 

Rceursiva  slow:  X  (A,3,C)  if  As[]  then  Cs[},  else 

If  CAR(A)<B  than  Csat'^3iff.Dafn(CDR(A},0sat-dalata.Alt(CAR(A},B),C}, 
else  CAR(A)*CAR(C)  and  Osat>Diff.Dafn(CDR(A),B,CDR(C)). 

QuicK:  C>0sat-Diff.AI{8(A,B). 

Domain/ransa:  <0sats  Osets  Osets> 

Algorithms: 

Nonracursiva:  X  {A,B)  for  x  in  A,  if  x  is  in  B,  than  remove  x  from  A  and  B. 

Recursive:  X  (A,B)  if  A«[}  then  [],  else 

If  CAR(A)<B  then  Oset-Diff.Alg(CDR(A},Oset-daleta.Alg(CAR(A),B)), 
else  CONS(CAR(A),Oset-Diff.Alg(COR(A),B)). 

Recursive:  X  (A,B)  if  A«[]  than  [],  else 

If  CAR(A)<B  then  Osat>Diff.Alg(CDR(A),B), 
else  CONS(CAR(A),Ocat-Diff.Alg(COR(A),B)}. 

Generalizations:  Difference 
Worth:  100 

What:  Moving  along  A,  when  an  element  also  in  B  is  encountered, 

use  Oset’delete  to  remove  it  from  A  and  from  B. 


Nama(s):  Set'Difference,  Set'diff. 

Definitions: 

Recursive  slow:  X  (A,B,C)  if  A'{}  then  Cb{},  else 

If  CAR(A)«B  then  Set-Diff.Oefn(CDR(A),$et-delete.Alg(CAR(A),B),C), 
else  CAR(A)>CAR(C)  and  $et-Diff.Defn(CDR(A),B,CDR(C)). 

Quick:  C«Set-Diff.Algs(A,B). 

Declarative  Nonrecursive:  (A,8,C)  Vx,  x<C  iff  x(A  and  xxCB. 
Domain/range:  <Sets  Sets  -»  Sets) 

Algorithms: 

Nonrecursive:  X  (A,B}  for  x  in  A,  if  x  is  in  B,  theri  remove  x  from  A  and  B. 

Recursive:  X  (A,B)  if  A»{)  then  {},  else 

if  CARfAXB  then  Set-Diff.Alg(CDR(A},3et-delete.Alg(CAR(A),B)}, 
else  CONS(CAR(A),Set-Diff.Alg(CDR(A),B)). 

Recursive:  X  (A,B)  if  A>{}  then  {},  else 

If  CAR(A)<B  then  Sei-Diff.Alg{COR(A),B), 
else  CONS(CAR(A),Set-Oiff.Alg(CDR(A),B)). 

Generalizations:  Difference 
Worth:  100 

What:  Members  of  set  A  which  are  not  in  Set  B. 
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Appendix  2.1.36  Bag-Diff 

Nam«(t):  BacDiffarancc,  Bag-diff. 

Oafinitions: 

Racurtiva  stow:  X  (A,B,C)  if  A«0  than  C«{),  alta 

If  CAR(AHB  than  Bag*Diff.Dafn(CDR(A),Bag>dalat».AIg(CAR(A),B),C), 
alta  CAR(A}«CAR(C)  and  Bag-Diff.Dafn(CDR(A),B,CDR(C)). 

Quick:  CBBag*Diff.Algc{A.B). 

Domain/ranga:  <Bagt  Bags  -*  Bags> 

Algorithms: 

Nonracut-siva:  X  (A,B}  for  x  in  A,  if  x  it  in  B,  than  ramova  an  x  from  A  and  B. 

Racurtiva:  X  (A,B)  if  A«()  than  (),  alsa 

If  CARfAXB  than  Bag-DiffAtg(COR(A),Bag-dalata.Alg(CAR(A),B)), 
alsa  CONS(CAR(A},Bag-Oiff.Alg(CDR(A),B}}. 

Racurtiva:  X  (A,B)  If  B«()  than  A,  alta 

If  CAR(3)<A  than  Bat-diff.Alg(Bag-dalaia.Alg(CAIUB),A),CDR(B)), 
alta  Bag*diffAlg(A,COR(B)). 

Ganaralizationt:  Diffaranea 

Worth:  100 

What:  Mova  x  along  Bag  B,  ramoving  ona  copy  of  aach  x  from  Bag  A. 
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Nama(s}:  Coalcsea,  $«lf*apply,  Condansa,  Cotlapsa,  Argumant  coincidanea. 

Dafinitions: 

Daclaraiiva  slow:  X  (F,G)  Tha  domain  of  Q  has  been  collapsed,  compared  to  F’s, 
by  the  removal  of  one  domain  component  0,  and  an  algorithm  for  G 
is  just  a  call  on  F,  with  two  arguements  tha  same.  Tha  only  constraint 
on  this  situation  is  that  tha  domain  component  from  which 
duplicate  argument  is  drawn  is  itself  a  spacialiaation  of 

Necessary,  quick:  X  (F,G)  Tha  length  of  each  Domain/ranga  entry  for 
F  is  one  larger  than  tha  length  of  each  entry  on  G.Dom/ranga. 

Necessary,  quick:  X  (F,G)  Tha  range  of  both  F  and  G  are  equal. 

Sufficient,  slow:  X  (F,G)  Are*equivalent(G,Coslesce.Algs(F)}. 

Sufficient,  quick:  X  (F,G)  G«Coalesca.Algs(F). 

Domain/range:  <Active  -*  Active) 

<Operation  Operation) 

<Predicate  -*  Predicate) 

Algorithms: 

Distributed:  use  the  heuristics  attached  to  Coalesce  to  guide  tha  filling 
in  of  various  facets  of  tha  new  Coalesced  concept. 

Generalizations:  Operation 
Isa's:  Operation 
Worth:  300 
Fillin:  4  heuristics. 

Check:  1  heuristic. 

Suggest:  2  heuristics. 


Com#  •x«mpl««  of  Ihi*.  (t)  Ce«lt«cf.O«fn(TIMES,Squirf),  btcauti  TIMES  Domaln/rangt  contain*  <Numl'*r  Number  -• 
Numb«r>  and  Squar*.Oomain/ran(*  contain*  <Numb*r Numb«t>,  and  a  definition  of  Square  i>.  'Tir.te«(x,x)’, 
and  clearly  Number  i*  a  apecialization  of  Number  (a  vacuou*  apacialization).  So  Square  i*  <;  cotiesciid  fnrin 
of  TIMES,  (ii)  Coale*ce.Defn(ln**rt,Self-in*ert),  where  the  Uttar  concept  i«  defined  a*  ln**rt(S,E).  Th,> 
domain  of  Ineert  i*  Anything  x  Structure:  the  domain  of  the  new  operation  i*  juat  Structure.  Thi*  pasaee 
Coaletce.Defn  becauae  Structure  i*  a  apecialization  of  Anything:  if  we  can  inaert  ANYTHING  into  a  structure, 
then  certainly  it  i*  permiaaabie  to  inaert  a  STRUCTURE  into  a  structure,  (iii) 
Coale*ee.Defn(Equality,Con*tant-T)  because  Equality  i*  raflexiva  (x>x  always). 
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Nam«(c):  Canoniz**  Canonicaliza,  Siandardizt,  eomatimas:  normalize. 
Dafinitions! 

Slow:  X  (P1,P2,F)  PI  and  P2  ara  pradicaies  over  AxA, 
and  F  is  ar  operation  from  A  to  A, 
and  (Vx,y<A)  Pl(x,y)  iff  P2(F{x),F{y)).2^ 

Sufficient,  alow:  Ara-aquivalant(F,Canoniza.AI{s(Pl,P2)). 

Sufficient,  quick:  F«Canonize.Alcs(Pl,P2}. 

Domain/rance:  <Predicate  Predicate  -*  Operation> 

Algorithms: 

Distributed:  use  the  heuristics  attached  to  Canonize  to  guide  the  filling 
in  of  various  facets  of  the  new  canonization. 

Generalizations:  Operation 
Isa’s:  Operation 
Worth:  200 
Fillin:  6  heuristics. 

Sueeest:  5  heuristics. 
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Appendix  2.1.39  Parallel-replace2 

- 1 

Nam«($):  Parall«l-r«plac«2,  Map>raplac*2,  Parallal-substituta. 

Dtfinitions: 

Quick:  GeParallel-Replaca2.A[gs(S  I  ,S2,F). 

Domain/ranja:  cType-of-siruetura  Type-of-aiructura  Operation  -♦  Oparation> 

Aleorithms: 

Nonracurtiva;  X  (S1,S2,F,G)  G  is  an  operation  whose  domain  is  SlxS2  and 
whose  range  is  Range(F).  For  any  structures  sl<Sl,  s2<S2, 

G(sl,s2)  is  compute  by  replacing  each  element  x  of  si  by  the 
value  of  F(x,s2).  Notice  this  means  that  F  must  be  an  operation 
with  a  domain/ranga  entry  of  the  form  <D  52  R>,  where  R  is 

unconstrained,  but  0  is  either  'Anything'  or  ■■  if  $1  is 
of  the  form  'Strueture-of-EV  ••  E. 

Non-recursive  quick:  X  ($1,$2,F)  if  F(x,y}  doesn't  depend  on  y, 
then  just  do  Paratlei-replaee.Algs(Sl,F). 

Specializations:  Parallel-replace 
Isa's:  Operation 
Worth:  100 

What:  create  a  new  operation,  which  takes  2  structures  $1  and  52,  and  replaces  each 

member  x  of  SI  by  F(x,S2). 


Appendix  2.1.40  Parallel-replacc 

Name(s):  Parallel-replace,  Map-replace,  Parallel-substitute,  MAPCAR. 
Definitions: 

Quick:  X  (S1,F,G)  G=Parallel-Replace.Algs(Sl,F}. 

Domain/range:  <Typs-of-structure  Operation  0per3tion> 

Algorithms: 

Nonrecursive:  X  (S1,F,G)  G  is  an  operation  whose  domain  is  SI  and 
whose  range  is  Range(F).  For  any  structure  si  (SI, 

G(sl}  is  computed  by  replacing  each  element  x  of  si  by  the 
value  of  F(x).  Notice  this  means  that  F  must  be  an  operation 
with  a  domain/range  entry  of  the  form  <D  -♦  R>,  where  R  is 
unconstrained,  but  0  is  either  'Anything'  or  --if  SI  is 
of  the  form  'Strueture-of-E's'  —  E. 

Generalizations:  Parallel-replace2 

Worth:  100 

Sugg:  2  heuristics.^® 

What:  create  a  new  operation,  which  takes  a  structures  SI,  and  replaces  each 
member  x  of  SI  by  F(x). 


2S 

Th«it  Ktuilly  dtal  with  aubatitulion  oparatioAt,  th«  RESULTS  of  applying  Parallal-raplaca  and  Parallfi-rapiac«2. 
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Appendix  2.1.41  Repeat2 


Nam«(«):  R«p«at2,  Map>r«paat2,  itarat«2,  Map2,  MAP2C0NC. 

Definitions: 

Quick:  X  (Si,S2,F,G)  G-Repeat2.Alss(Sl,S2,F). 

Domain/ran{e:  <Type*of>structure  Type’Of*itructure  Operation  -*  Operation) 
Algorithms: 

Nonrecursive:  X  (S1,S2,F  GsRepeat2(Sl,S2,F}  is  an  operation  whose 
domain  is  SlxS2  and  whose  range  is  Range(F). 

For  any  structures  si  (SI,  s2(S2, 

G(sl,s2)  is  computed  by  the  following  algorithm: 
y«-CAR{8l)5sl*-CDR{8l); 
while  si  do:  y«-F(y,s2,CAR(8l));  sl«-C0R(8l); 

Finally,  return  y. 

Notice  this  means  that  F  must  be-  an  operation  whose  domain/range 
has  the  form  <sl  S2  si  •*  sl>. 

Non-recursive  quick:  X  (S1,S2,F)  if  F(x,y,z)  doesn't  depend  on  z, 
then  jujt  do  Repeat. Alg8(Sl,F). 

Specializations:  Repei  ‘ 

Isa’s:  Operation 
Worth:  100 

What:  create  a  new  operation,  which  takas  2  structures  $1  and  S2,  and  repeats 
F(x,y,s2)  along  the  members  x,y  of  Si. 


Appendix  2.1.42  Repeat 


Name(s):  Repeat,  Map,  Iterate,  Sequence. 

Definitions: 

Quick:  X  (S1,F,G)  G>Repeat.Algs(Sl,F). 

Domain/range:  (Type-of-structure  Operation  *4  Operation) 

Algorithms: 

Nonrecursive:  X  (S1,F)  Repeat(Sl,F)«G  is  an  operation  whose  domain 
is  SI  and  whose  range  is  Range(F).  For  any  structure  sl<Sl, 

G(sl)  is  computed  by  the  following  algorithm: 
y«-CAR(sl)$8l<-C0R(sl)) 
while  si  do:  y«>F(y,CAR(8l));  il*-C0R(sl)i 
Finally,  return  y. 

Notice  this  means  that  F  must  be  an  operation  whose  domain/rangc 
has  the  form  <sl  si  -f  si). 

Generalizations:  Repeat2 
Worth:  100 

What:  create  a  new  operation  which  repeats  F  all  the  way  along  an  $1. 
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Appendix  2.1.43  ParaIlel-ioin2 


Nam*(s}:  Parall«l'join2,  Map-join2,  Parallal*union2,  MAP2C0NC. 

Oafinitions: 

Quick:  X.  (S1,S2,F,G)  GsParall«l-join2.Algs(Sl,S2,F). 

Oomain/ranga:  <Typa-of-alructura  Typa-pf-atructu^’s  Oparaiion  -»  Oparation> 
Algorithms: 

Nonracursiva:  X  (S1,S2,F,G)  G  is  an  operation  whosa  domain  is  SlxS2  and 
whosa  ranga  is  Ranga(F).  For  any  structuras  sUSi,  s2<S2, 

G(sl,s2)  is  eomputa  by  appending  together  the  vaiues  of  F(x,s2}, 
for  each  element  x  in  si.  So  F  has  to  be  an  operation 
with  a  domain/ranga  entry  of  the  form  <D  $2  -*  K>,  where  R  is 
a  type  of  structure,  but  0  is  either  'Anything'  or  if  $1  it 
of  the  form  'Structure-of-E's'  —  E. 

Non-racursiva  quick:  X  (S1,S2,F)  if  F(x,y)  doesn't  depend  on  y, 
than  just  do  Parallol-join.Algs(Sl,F). 

Specializations:  Parallahjoin 
Isa's:  Operation 
Worth:  100 

What:  create  a  new  operation,  which  takes  2  structuras  SI  and  S2,  and  joins 

together  F(x,s2)  for  each  member  x  of  SI. 


Appendix  2.1.44  Parallel-ioin 


Name(s):  Parallel-join,  Map-join,  Parallel-union,  MAPAPPEND,  MAPCONC. 
Definitions: 

Quick:  X  (Si,F,G)  G>Parallel*join.Algs(Sl,F). 

Domain/range:  <Type-of-structure  Operation  -*  Operation> 

Algorithms: 

Nonrecursive:  X  (S1,F,G)  G  is  an  operation  whose  domain  is  SI  and 
whose  range  is  Range(F).  For  any  structure  sHSl, 

G(sl)  is  computed  by  appending  together  the  values  of  F(x), 
for  each  x<sl.  Notice  this  means  that  F  must  be  an  operation 
with  a  domain/range  entry  of  the  form  <D  -*  R>,  where  R  is 
a  type  of  structure,  and  D  is  either  'Anything'  or  —  if  SI  is 
of  the  form  'Structure-of-E’s'  —  E. 

Generalizations:  Parallel-join2 
Worth:  100 

What:  create  a  new  operation,  which  takes  a  structure  SI,  and  joins  together 

F  of  each  member  of  SI. 
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Apoendix  2.1.45  Reverse-ord-pair 

Namt(s):  Ravartt'Ord-pair,  Ravarca  ordarad  pair,  Switch  CAR  and  CADR. 

Dafinitions: 

Nonraeursive  quicK:  X  (P,Q)  First.Alg(P)EFinal.AI{(Q), 
and  Final.AI{(P}>Firat.Alg(Q). 

Quick:  X  (P,Q)  QeRavaria’Ord‘pair.Algc(P}. 

Domain/ranga:  <Ordarad‘'pair  -*  Ordarad*pair> 

Algorithms: 

Nonracursiva:  X  (P)  Q«-P;  Fir8t.Alg(Q,Final.Alg(P))*®;  Final.Alg(Q,First.Alg(P));  Q. 
Nonracursiva  quick  opaqua,  nondastructiva:  X  (P)  LIST(CADR(P),CAR(P). 

Nonracursiva  quick  opaqua,  destructiva:  X  (P)  z^Last~ala(P}; 

FRPLACA(CDR(P),CAR(P)){  FRPLACA(P,z)}  P. 

Nonracursiva  quick  opaqua,  nondastructiva:  X  (P)  REVERSE(P}. 

Nonracursiva  quick  opaqua,  dastructiva:  X  (P)  DREVERSE(P). 

Isa’s:  Operation 
Worth:  100 

What:  turn  the  ordarad  pair  <x,y>  into  the  ordarad  pair  <y,x>. 


Appendix  2.1.46  Last-element 


Nama(8}:  Last-alamant,  Final  mambar. 

Definitions: 

Racuriiva:  X  (S,x)  2«*First'‘alamant.Alg($),  and  $4-Dalata.Alg(z,S), 

and  if  Empty-itruc.DafnfS)  than  xn,  else  Last<’alamant.Dafn($,x}. 
Quick:  X  (S,x)  xeLast-alamant.AIgsfS). 

Domain/ranga:  <Ordarad*structura  ->  Anything> 

Algorithms: 

Recursive:  X  (S)  z**First*alamant.Alg(S),  and  S^Dalata.Aig(z,$), 
and  if  Empty*siruc.Dafn(S)  than  z,  else  Last'alamant.Alg(S). 
Nonracursiva  quick  opaqua:  X  (S)  CAR(LA$T(S)). 

Isa’s:  Operation 
Worth:  100 

What:  find  the  final  mambar  of  the  ordarad  structure 


29 

The  exprettion  Firtt  Al|(A,x)  will  rt«ult  in  «  RPLACA:  the  firtt  •lement  of  A  will  bo  romovod,  tnd  in  it*  ploee  x  will 
tppoir.  Thus  First  Alg(<s  b  c  d>,  x)  will  rsturn  ss  its  vsluo  tho  now  list  <z  b  c  d>. 

30 

Actutlly,  this  concept  is  much  more  sophisticstod.  If  Litt-olomont.AI|s  is  esilod  with  TWO  oriumonts,  S  end  v,  then  the 
intention  ie  taken  to  be  to  REPLACE  the  lest  element  of  S  by  the  element  v.  Thus  that  lest  element  ie 
deleted,  end  v  is  added  at  the  end  of  S.  This  is  done  by:  FRPlACA(LAST(S),v/.  To  review:  Last- 
elementAlg(A,x)  rasats  the  final  member  of  A  to  x,  while  Last>alement  Defn(A,x)  merely  tests  whether  the 
last  member  of  A  is  x. 


 ,J.-  -J,  . 
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Appendix  2.1.47  First-element 


Nam«(t)!  ririi'fliamant,  Initial  mambar,  Haad,  Front  alamant,  CAR. 

Dafinitionst 

Racurtivat  X  (S,x)  s«-Lait*alamant.Alc($),  and  S*-Dalata.Alt(2,S), 

and  if  Empty-itrue.DafntS)  than  xaz,  alta  Firtt-alamant.Dafn(S,x). 
QuicK:  X  (S,x)  xaFirit*alamant.Alt<(S). 

Domain/raniat  <Ordarad-itruetura  *4  Anythinf> 

Algorithma: 

Racuriiva:  X  (S)  2*’Last*alamant.Alg(S),  and  S*'Oalata.AI|(z,$), 
and  if  Emptyatruc.DafnfS)  than  z,  alta  Firtt*alamant.Alf (S). 
Nonracuriiva,  vary  quick,  opaquat  X  ($)  CAR($). 

Isa'c:  Oparation 
Worth:  100 

What:  find  tha  initial  mambar  of  tha  ordarad  atruetura  S.^' 


Appendix  2.1.48  All-but-the-first-element 


Nanfw(s):  Raar,  All  but  tha  first  alamant,  All*but*firtt,  COR,  Tail,  tomatimas:  back. 
Oafinitiont: 

Nonrecurtiva:  X  (S,R)  Litt''dalata.Dafn(CAR(S),S,R}. 

Nonracjriiva:  X  ($,R)  Ust*intart.Oafn(CAR(S),R,S). 

Nonracurtivai  X  ($,R)  CDR(S)*R. 

Quick:  X  (S,R)  R«RaarAlgs($). 

Domain/rangat  (Ordarad-itructura  Ordarad-itructura) 

Algorithms: 

Nonracurtiva,  vary  quick,  opaqua:  X  ($)  CDR(S) 

Nonracursiva:  X  ($)  z«-Firtt'aia.Alg(S};  Litt'dalataAlgs(z,S). 

Isa*t:  Oparation 
Worth:  100 

What:  ramova  tha  initial  nMmbar  of  tha  ordarad  structura  S. 
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Actutlly,  this  optrtlion't  tlforithm,  if  ftd  two  tr|umont*  S  tnd  v,  will  rtpitct  tht  firtt  oltmont  of  $  by  v,  voinf 
FRPlACA(S,v).  So  thi*  tinflo  concept  contiint  both  CAR  tnd  FRPLACA  knowlodft.  Thit  io  not  ohown 
explicitly  in  the  ontriot  for  Firtt-oWmontAlft. 


Apptndix  2  AM  Discovery  in  Mithamitie*  at  Hturi$tic  $«irch  *202' 

Appendix  2.1.49  All-biit-the-last-element 

N«mt(s):  All*but-<h«-last*«l«m«nt,  All-but-last,  iom«iim«s:  front. 

Dofiniiions: 

Quick:  \  (S,R)  R»AII-bul-last.Alt8(S). 

Domain/ranga:  <0rdar*d-8tructura  -» 0rd#rad-8tructur«> 

Algorithm8: 

Nonracuraivo,  vary  quick,  opaqua:  X  (S)  f  RPLACO(LAST($},NIL). 
iaa'c:  Oparation 
Worth:  100 

What:  ramova  tha  final  alamant  from  tha  ordarad  struetura  S. 


Appendix  2.1.50  Member 


Namafa):  Soma-alamant,  Random  mambar,  Any  alamant  of,  Mambar,  In,  Soma-mambar. 
Dafinitions: 

Racuraiva:  X  (x,S}  Nonampty-8truc.Dafn($)  and 
if  First’’ala.Dafn(S,x)  than  Trua, 

alsa  Mambar.Dafn(x,AII*‘but*firat*‘ala.Alg(S)}, 

Nonracuraiva  quick  opaqua:  X  (x,S)  MEMBER(x,S)}. 

Suffieiant,  vary  quick,  opaqua:  X  (x,$)  FMEMB(x,$}). 

Quick:  X  (S,x)  x>Mambar.AI|i(S). 

Domain/ranga:  <Structura  *4  Anythin|> 

Algorithma: 

Nonracuraiva  opaqua:  X  ($}  CAR(RAND-PERMUTE(S)}. 

Nonracuraiva  quick  opaqua:  X  (S)  CAR(S)). 

Racuraiva  alow:  if  S  la  ampty  than  fail,  otharwiaa  if  $*(x)  than  x, 
alaa  if  RAND(0,l)«i  than  Firat*ala.AI|(s}, 
alaa  Mambar.Alg(AII>but>laai.Alg(S)). 

laa’a:  Oparation 
Worth:  100 

What:  find  a  random  mambar  of  tha  atructura  $. 


-J-  -j> u* r.s ’ K* u-sr«r;-5ru5-'.N\.V'’Clv:’^riV'^^Vi -v'  ■<.’  «.  ■’<.- 
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Appendix  2.1.S1  Projectionl 

Namf(t):  Projtciionl)  First'irgumeni,  Projl. 

Dsfinitions: 

Nonrscuriiv*  quick:  X  (x,y,...,z)  z«x 

Quick:  X  (x,y,...q,z)  z«Som«’«l*m«nt.Algs(x,y,...q). 

Domain/rang*:  <*-D  Anything...Anything  -» 'D’> 

Algorithms: 

Nonroeuriiv*  quick:  X  (x,y,...,q)  x. 

Isa’s:  Operation 
Specializations:  Identity. 

Worth:  100 

Vi/hat:  accept  a  bunch  of  arguments  and  return  the  first  one. 


Appendix  2.1.5?  Projection2 

Name(s):  Projection2,  Second’srgument,  Proj2. 

Definitions: 

Nonrecursive  quick:  X  (x,y,...,z)  z^y 
Quick:  X  (x,y,...q(Z)  z«Some*element.Algs(x,y,...q). 
Domain/range:  <Anything  *-D  Anything...Anylhing  -♦  'D’> 
Algorithms: 

Nonrecursive  quick:  X  (x,y,...,q)  y. 

Isa’s:  Operation 
Specializations:  Identity. 

Worth:  200 

What:  accept  a  bunch  of  arguments  and  return  the  second  one. 
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Thie  meene  thet  'D*  cm  be  enythinf,  to  lonf  «•  H'e  the  etme  in  both  piece*  in  Ihe  dom*in/ran|e  templete.  Thu*  thi* 
incK/dee  <S*t*  Anylhini  Anythinf  e  S*t*>. 


Apptndix  2 
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Appendix  2.1.53  Identity 


Nam«(s):  Identity,  identity-operation,  no-op.  Self,  no  change. 
Definitions: 

Nonrecursive:  X  (x,y)  Equality.Defn{x,y) 

Nonrecursive  trensform:  X  (x,y)  Projl.Defn(x,x,y) 
Nonrecursive  transfoi'm;  X  (x,y)  Proj2.Defn{x,-5,y) 

Sufficient,  v^ry  quick,  opaque:  X  (x,y}  EQ(x,y}}. 

Quick:  X  (x,y)  y«ldentity.Algs(x). 

Domiin/range:  <Anything  Anything) 

<0bject  -» Object) 

<Structures  Structures) 

<Active  Active) 

Algorithms: 

Nonrecursive  quick:  X  (x)  x. 

Nonrecursive  transform:  X  (x)  Projection!. A!gs(x,x). 
Nonrecursive  transform:  X  (x)  Projection2.Algs(x,x). 

Conjee:  'Identity,  restricted  to  Objects,  is  the  same  as  Obj-Equality.* 
Generalizations:  Projection!,  Projection2. 

Worth:  !00 

What:  the  identity  operation,  closely  related  to  Equality. 


Appendix  2.1.54  Restrict 


Nsme(s):  Restrict,  Constrain  the  domain/range  of  an  active. 

Definitions: 

Nonrecursive:  X  (F,G)  The  domain/range  of  G  are  more  restrictive^ 
than  that  of  F,  and  G.Defn  is  just  a  call  on  F.0efn. 

Sufficient,  Quick:  X  (F,G)  GaRestrict.Algs(F). 

Domain/ranget  <Active  Active) 

<Operation  -*  Operation) 

<Predicate  Predicate) 

Algorithms: 

Distributed:  use  the  heuristics  attached  to  Restrict  to  guide  the  filling 
in  of  various  facets  of  the  new  Restricted  concept. 

Plus:  an  explicit  little  program  for  making  the  substitution 

in  the  Domsin/range  facet,  which  is  the  essence  of  this  concept. 

Isa's:  Operation 

Worth:  200 

Fillin:  3  heuristics. 
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That  M,  one  (or  mor«)  componont  of  th«  G.Oomain/rinfo  onlry  i*  i  proper  <poei«lii*tlon  of  <h«  corratpondinc  F.Dom/ran 
antry,  and  ail  tha  othar  componanta  match  up  aquilly. 
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)endix  2.1.55  Invert-an-operation 


Nam«(s):  Invart,  Find  ih«  invars*  of  in  optration. 

Dofinitions: 

Declirstiv*  slow:  \  {F,G)  Th*  domain  of  G  is  the  range  of  F,  plus  al!  ih* 
domain  components  of  F  except  one,  D;  the  rang*  of  G  is  then  D. 
The  value  of  G.D*fn(xi,.M,r,...,d)  must  be  the  same  as  the  value 
the  value  of  F.D*fn(xl,...,d . r),  for  any  xl,...,d,  and  r. 

Necessary,  quick:  X  (F,G)  The  length  of  each  Domain/rangc  entry  for 
F  is  the  same  as  the  length  of  each  entry  on  G.Oom/range. 

Necessary,  quick:  X  (F,G)  Taken  as  SETS,  a  domain/range  entry  from  F 
and  on*  from  G  are  actually  Equal. 

Sufficient  quick:  X  (F,G}  G  has  the  Nam*  'F'invers*'. 

Quick:  X  (F,G)  Gsinv*ri.Algs(F). 

Domain/rang*:  <Op*ration  -*  Operation> 

<Op*r*tion  •+  lnv*rt*d-op> 

Algorithms: 

Distributed:  us*  the  heuristics  attached  to  Invert  to  guide  the  filling 
in  of  various  facets  of  the  new  Inverted  poncept. 

Isa's:  Operation 

Worth:  300 

Fillin:  1  heuristic. 

Suggest:  1  heuristic. 


Name(s):  Inverted  op*rstion,lnv*rs*,  sometimes:  converse. 

Definitions: 

Declarative  slow:  X  (F)  For  some  known  operation  G,  lnv*rt.D*fn(G,F). 
Necessary,  quick:  X  (F)  The  rang*  of  F  is  one  single  known  concept. 
Sufficient  quick:  X  (F)  F  has  the  Nam*  'G*inv*rs*'  for  some  G. 
Generalizations:  Operation 
In-domain-of:  invert.®^ 
ln-rang*-of:  invert 
Worth:  200 


*  Thi*  just  m**n*  <h«t  «ueh  opsrition*  tr«  th«mt*lv*«  titily  inv«rt*bl*. 


App«ndix  2 
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Appendix  2.1.57  Relation 


Namt(t):  RtUtion,  rtlationthip. 

Definitions:  non*. 

Generaiizstions:  Active 
Specializations:  Logical'combination 
Worth:  100 

View:  To  view  an  operation  F  ar  a  relation,  consider  it  as  the  set  of  all  ordered 
pairs,  a  subset  of  Dom(F}xRan(F),  containin{  <x,y>  iff  F.Defn(x,y). 

NOTE:  This  concept  exists  in  only  rudimentary  form  in  AM  at  the  moment. 


Appendix  2.1.58  Logical-combination 


Namefs):  Losieal  Combination,  Boolean  relation. 

Definitions:  none. 

Generalizations:  Relation. 

Examples:  Conjoin,  Disjoin,  Imply,  Negate^^ 

Worth:  200 
Check:  1  heuristic 
Interest:  3  heuristics 
Sugg:  2  heurist’M 

NOTE:  This  concept  exists  in  only  rudimentary  form  in  AM  at  the  moment. 
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Th«««  (rtn't  codsd  ••ptrCtly  «t  conctptf  in  AM,  y«t. 
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Appeiidix  2.1.59  Object 

Name(s):  Objtct,  static  concept,  Passive 
Definitions:  none.^^ 

Specializations:  Structure,  Atom*obj,  Conjecture^^ 
Generalizations:  Any*concept 
Examples:  none. 

Isa's:  An/'concept 
In-domain-of:  Object-equality 
Worth:  100 
{No  heuristicsf^ 


Appendix  2.1.60  Conjecture 

Name(s):  Conjecture,  Conjee,  Hypothesis,  Guess,  Observation,  Thesis,  Belief. 
Definitions: 

Nonrecursive,  QuicK:  X  (x)  Match  x  with  <C0NJEC:  ...> 

Generalizations:  Object 
ln-domain*oft  Prove^®,  Disprove,  Tost 
In-ranju-cf:  none^®. 

Worth: 
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Rtcall  that  all  thit  mtant  it  that  computationally)  any  antily  x  la  conaidarad  to  be  an  Objact  iff  it  ia  an  axampla  of  aomt 
Specialization  of  thia  concap*  Thua  tha  iiat  (3  A  NIL)  ia  an  objact,  bacauaa  it  ia  a  Lift,  and  Liat  ia  ona 
Spacialization  of  Structure,  ard  Structure  ia  a  Specialization  of  Objact. 

37 

Thia  ahould  be  'Statement',  and  that  concept  ahould  have  Conjecture  aa  a  apecialization,  along  with  Theorem,  Falaahood, 
etc.  Thit  waa  never  fully  implemented  in  the  AM  code,  however 

38 

The  paucity  of  heuriatica  here  atteata  to  the  little  that  atructurea,  atatamenta,  and  atoma  have  in  common.  They  are 
merely  non-active«.  There  ia  much  that  doea  not  apply  to  any  of  tham  (aee  the  Active  and  Operation 
concepta),  but  very  few  rulet  of  thumb  applicable  to  all  3  of  them. 

39 

At  the  moment,  none  of  thete  three  concepta  ia  in  AM. 

AO 

Conjtctuptt  art  prodvetd  by  hiuriitk  not  machinkally  by  runnini  torn*  Activa  concapt 


M*ny  of  th«  noun*  in  thi*  box  tr*  not  impl*m*nl*d  *t  eonctpt*  in  AM;  •!.,  V*ri*bl«,  Utntifnr,  UNPACK,  MKATOM,... 

'  Tbn  *b«*nc*  of  iny  h*uri«<icf  htr*  ju*t  *mph**a*«  <h«  ftef  1h«t  littr*!  con»t«nt*,  idontifwr*,  v*ri*bl«*,  T,  ofc.  hav*  vary 
litti*  in  common  th*4  ALL  object*  don’t  «h*r*. 

'  Sine*  no  definition  i*  providad,  AM  n«v*r  {•noralizad  or  *p«ci«liz*d  thi*  concept,  looked  for  new  example*  of  it,  etc. 

^  Thue,  •*  in  Liep  iteelf,  *n  entity  i*  eeeociated  with  Felee  iff  it  i*  null,  and  with  True  iff  it  i*  anythini  el*e  in  the  world. 
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A  »?t>eiidix  2.1.63  Structure 


Nam«(s):  Structurs,  Data-si.  -.-^  i somstimas:^^  List-structure. 

Definitions: 

Necessary,  Non-Recursiva,  Quick,  Opaque:  \  (x)  LISTP(x) 

Specializations:  Ord-struc,  Unord-struc,  Empty-strjc,  Non-ampty-strue, 

Multiple-alements-struc,  No-multiple-olements-struc,  Struc-of-strucs. 
Generalizations:  Object 

In-domain-of:  Insert,  Delete,  Member,  Empty,  Nonempty,  Difference,  Union,  Intersect, 
Parallel-replace(2),  Parallel-join(2),  Repeat(2). 

In-range-of:  Insert,  Delate,  Difference,  Union,  Intersect. 

View:  To  view  any  entity  x  as  a  structure,  insert  x  into  an  empty  structure. 

Worth:  200 
Fillin:  2  heuristics, 
interest:  2  heuristics 


endix  2.1. 


Namets):  Structure-of-structures,  struc-of-strucs. 

Definitions: 

Recursive:  X  (S)  Empty-struc.Defn(S)  or 

[Structure.Defn($)  and  z^Member.AI((S)  and  $trueture.Defn(z)  and 
Structure-of-Structures.Defn(Delete.Ales(z,$))]. 

Declarative  PC:  X  ($)  $tructure.Defn(S)  and  (Vx($)  $tructure.Defn(x). 
Specializations:  none.^^ 

Generalizations:  Structure 
Worth:  300 


That  It,  tha  uatr  micht  trroneocaly  typa  'Liat-atructura  wnan  ha  raally  tnaana  any  kind  of  atructura. 

AM  apacializad  thia  by  raplaeinf  aaeh  of  tha  two  ealit  on  'Strvetura  Dafn'  inaida  Strue>of-ttruca.Oafn  by  a  call  on  tha 
daf inilion  of  a  ainfla  typa  of  atructura,  tharaby  creating,  a  g.,  Bag-of*Sata,  Liat-of-Oaata,  Bag-of>Primaa, 
ate.  Theaa  apaciairzad  concapla  wara  than  kapt  around  ao,  a.g.,  tha  aampla  tracaa  in  Chaptar  6  and  in 
Appendix  5  aomatimaa  refer  to  them.  Alao,  thia  concept  and  ita  apacializationa  can  be  diacoverad 
indapandantly  by  AM,  uaing  hauriatie  rule  number  232  (aaa  Appendix  3)  to  form  a  new  intaraating  typa  of 
atructura. 
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Appendix  2.1.65  Ord-Structure 


Namt(«):  Ord'ttrue,  Ordered  Structure,  lOmetimes:  List-itructure. 

Definitions:  none 
Specializations:  Osets,  Lists 
Generalizations:  Structure 

In'domain-of:  First~ele,  Last*ele,  All'but'first-ele,  All-but'last-ele. 
In-ranje-of:  All-but-first-ele,  All*but-last»ele. 

View:  To  view  any  unord-struc  as  an  ord-struc,  do  nothing  to  it,  or  permute  it. 
Worth:  200 
Fillin:  2  heuristics. 

Check:  2  heuristics. 

Interest:  1  heuristic 


Appendix  2.1.66  Unord-Stnicture 


Nsme(s):  Unord'struc,  Unordered  Structure,  eometimes:  Coliection 
Definitions:  none 
Specializations:  Sets,  Bags. 

Generalizations:  Structure 

View:  To  view  any  ordered-struc  as  an  unord-struc,  SORT  it. 
Worth:  200 
Check:  1  heuristic. 


Appendix  2.1.67  Multiple-elements-structure 

Name(s):  Multiple-elements-structure,  Muit-ele-struc,  sometimes:  Lists. 

Definitions:  none 

Specializations:  Li:its,  Bags 

Generalizations:  Structure 

In-domain-of:  nene.^^ 

View:  To  view  <iny  nonmuit-struc  as  a  muit-struc,  do  nothing  to  it, 
or:  copy  some  elements  inside  it  a  random  number  of  times. 
Worth:  200 
Fillin:  1  heuristic. 
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Thar*  tr*  many  tptciil  funcliont  which  can  only  maka  tana*  for  multipla-alaa  atrueturaa,  *(.,  R*inov*-l-oecurr*ne*(x,S>, 
varau*  R*inov*-all-occurr*nc**(x,S).  Such  oparationa  hava  not  yat  baan  codad  and  addad  to  AM. 
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Appendix  2.1.71  Sets 


Nam«(t):  Set,  Class,  Collaciion 
Dafinltionss^* 

Racursivt:  X  (S)  (S«{}  or  SaiDofinition  (Sat'‘D«lat«.AIg(Momb«r.Alg(S),S))) 
Racurtivo  quick:  X  ($)  ($■{}  or  SalDafinition  (CDR(S))) 

Quick:  X  (S)  (Match  S  with  {...} ) 

Intuitions:  nona  at  prciant.^^ 

Spaciatizations:  Sat-o(*structuras^° 

Ganaralizations:  Unordarad-Structura,  No-multipla-alamants-Structura 
in-domain-of:  Sat-union,  $at-intarsact,  $at*diffararica,  Sat-insart,  Sat-daleta 
ln*ranga*of:  Sat-union,  Set-intarsact,  Sat-di((aranea,  Sat-insart,  Sat-dalata 
Viaw:  To  viav/  any  structura  as  a  Sat,  do:  X  (x)  Enclosa-in-bracastx) 

To  viaw  any  pradicata  as  a  Sat,  do:  X  (P)  S**{}. 

Foraii  X  in  Exatnplas(Oomein(P)):  If  P(x)  than  Sat-insart.Alg(x,S). 

Worth:  400 
Sugg:  1  hauristic. 

Intarast:  1  hauristic. 


Appendix  2.1.72  Bags 

Nama(s):  Bag,  somatimas:  Multisat,  lomatimas:  Collaction. 

Dafinitions: 

Racursi«'a:  X  (S)  (S«( )  or  Bag.Dafimtion(Bag-dalata.Alg(Mambar.Alg(S),S))) 
Racursiva  quick:  X  (S)  (S«( )  or  Bag.Do(inition  (C0R(S})) 

Quick:  X  (S)  (Match  S  with  (...) } 

Spacializations:  Bag-of-structuras^^ 

Ganaraiizations:  Unordarad-Structura,  Multipla-alamants-Structura 
Worth:  400 

In-domain-of:  Bag-union,  Bag-intarsact,  Bag-diffaranca,  Bag-insart,  Bag-dalata 
In-ranga-of:  Bag-union,  Bag-intarsact,  Bag-diffaranca,  Bag-insart,  Bag-dalata 
Viaw:  To  viaw  any  structura  as  a  Bag,  do:  X  (x)  Enclosa-in-parans(x) 


4S 

A  eurprieini  Met,  which  fell  out  niturilly  while  d**i|nin|  the  entrie*  for  the  definition  fecete  of  Sett,  Bt|*,  etc.,  i*  thet 
the  difference*  between  th**e  etructuree  i*  not  in  their  definition  eo  much  «•  in  the  pirticultr  operator* 
which  work  on  them.  Thu*  ill  4  kinds  of  etructuree  eppeer  to  hive  eyntictieelly  eimiler  concepts,  even 
incKMini  their  definition*.  The  retder  muet  examine,  *(.,  the  definition  of  Bi|>int*rt  and  S*t>in*art  to 
discover  the  real  difference*  between  the  Set  end  Beg  etructuree  which  AM  know*  about. 

49 

Several  nice  intuition*  were  orig inelly  provided,  then  scrapped  when  ALL  intuition*  were  excised  from  AM. 

50 

This  concept  was  eyntheeixed  by  AM,  but  was  then  left  'permanently'  in  place. 


I.  r  ■  r  •  ' 


Apptndix  2 


AM  Oitceviry  in  Math«mitic*  i$  Heuriitic  Starch 


-213- 


Appendix  2.1.73  Lists 


Nam«(t):  List,  Liit*structur«,  Vtetor,  Tuple,  nMupIc,  Sequence,  Ordered-beg 
Oefinitionit 

Recursive!  X  (S)  ($■<  >  or  List.Definition(List'‘Delete.Alg(Membv.'.AIg(S),S))) 
Recursive  quick:  X  ($)  ($■<  >  or  List.Definition  (C0R($))} 

Quick:  X  (S)  (Match  S  with  <„.>  ) 

Generalizations:  Ordered*$tructure,  Multiple-elements'Structure 
Specializations:  Ordered'pairs 
Worth:  400 

ln>domsin*of:  List-union,  List-intersect,  List-difference,  List-insert,  List-delete.^' 
In-range-of:  List-union,  List-intersect,  List-difference,  List-insert,  List-delete 
View:  To  view  an/  structure  as  a  List,  do:  X  (x)  Enclose-in-angle-bracketstx) 


Appendix  2.1.74  Ordereci-paifs 

Name(s):  Ord-pair,  Opair,  Ordered  pair,  2*tuple,  sometimes:  i/,o  pair,  pair. 

Definitions: 

Declarative:  X  (S)  There  exist  x  and  /  such  that  $><x,y>. 

Nonrecursive  opaque:  LisiDefinition  (S)  and  CDR($)  and  Null(CDDR(S)). 

Nonrecursive  slow:  X  (S)  List.Definition($),  and  S/O,  end  2*'Member.Alg(S}, 
and  S«-Liet-delete.AI|(z,S),  and  $/<>,  and  /^Member .AI|(S), 
and  Liet-delete.Defn(y,$,<». 

Nonrecursive  quick:  X  (S)  (Match  S  with  C-x,**/) ) 

Generalizetions:  Lists 

Worth:  200 

In-domain-of:  Reversa-ord-psir 

In-range-of:  Raverse-ord-pair 

View:  To  view  any  entity  x  as  an  ordered  pair,  consider  the  pair  <x,x>. 

View:  To  view  an  example  of  an  active  concept  F  as  an  ord-pair,  construct  the 
pair  whose  first  element  is  a  list  of  the  arguments  to  F 
[or:  THE  argument  to  F,  if  there  is  only  one],  and  whose 
second  element  is  the  value  of  F  on  those  srg(s). 

View:  To  view  an  (ordered)  structure  S  as  an  Opair,  consider  the  pair  whose 
first  element  is  some  member  of  (the  first  member  of)  S,  and 
whose  second  element  is  all  the  remaining  members  of  S. 

View:  Transform  the  ordered  structure  (a  b...c)  into  the  Opair  (a  b)  or  (a  c). 


Tlwra  tra  many  tpfei«l  function*  which  can  only  mak*  aana*  for  liata,  i . ,  thi*  on*:  '6*lw**n(x,S)'  which  raturn*  a  liat  of 
all  alamant*  lyin|  after  th*  firat  occurrence  of  x  in  S,  but  before  th*  aaeond  accurranc*.  Such  operation* 
nav*  not  yat  been  coded  and  addad  to  AM. 
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Appendix  2.1.75  QsetS 


Namt(t):  Ot«t,  Oi«t>struetur«,  Ordarad-tat,  tomatimat:  Sat. 

Dafinitionss 

Racurciva:  X  (S)  ($■[  ]  or  Oiat.Oafiniiion(Otat*Dalata.Alg(Mambar.Alg(S}|$)}} 
Racursiva  quick:  X  (S)  ($■[ }  or  OsaiDafinition  (CDR(S))) 

Quick:  X  (S)  (Match  S  with  [...] ) 

Ganaralizations:  Ordarad-Structura,  No-multipla*alamantc*Structura 
Worth:  400 

in-dotnain’Of:  Otat-union,  Osat'intarsact,  Osai>di(faranca,  Osat'insart,  Oset*dalata 
In-ranKa-of:  Osat«union,  Osat*intartact,  Osat-diffaranca,  Osat-insart,  Osat-delata 
Viaw:  To  viaw  any  itructura  as  a  Osat,  do:  X  (x)  Enclo8a~in>aquara-brackat8(x) 


Appendix  2.2.  Concepts  never  fully  implemented 

The  following  concepts  were  designed  ’’on  paper"  before  AM  was  coded,  but  were  never  put 
into  AM  -  at  least  not  fully.  Future  work  on  AM  may  Include  their  coding,  insertion  into 
AM.  and  debugging.  An  asterisk  (<•)  means  that  a  crude,  rudimentary  version  of  the 
concept  was  coded  and  placed  in  AM,  but  had  little  impact  on  its  behavior. 


Statamant:;  would  include  conjectures,  theorems,  axioms,  hypotheses,  conclusions, 
relationships. 

Prove,  Disprova,  Proof,  Couniaraxampla,  Thaoram,  Taehniquas  for  proving  oxistonco, 
Tachniquaa  for  astabliahing  univaraal  conjacturaa,...:  altogether  about  two  dozen 
concepts  were  designed. 

Mathamatical  Induction,  including  double  induction. 

Mathamatical  theory,  aystam,  baaia,  foundation,  axiom,  iaomorphiam,... 

-  Cauaa  and  affect:  their  relation  to  theory  formation. 

Variable,  Aaaignmant,  Binding,  Quantification,  Scope,...:  a  dozen  concepts  along  these  lines. 

Constant,  Identifier,  PNAME/P2NAME,...:  AM  never  really  needed  any  non-opaque 
information  about  these,  although  future  expansion  of  the  system  should  probably 
include  the  coding  and  insertion  of  these  concepts. 

Invarae-coalesce:  Given  an  active  concept  F(x),  replace  some  occurrences  of  x  in  F.Defn 
by  "y",  thereby  making  a  new  operation  which  is  a  function  of  x  and  y. 

Nogato,  Conjoin,  Disjoin,  Imply,...;  These  logical  operators  and  relationships  had  too  little 
semantic  information  to  make  it  necessary  to  encode  each  one  into  a  concept. 

(«)  Conatruefiv*,  Deatructiva:  these  two  predicates  would  Judge  any  operation. 
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Appendix  2.3.  Concepts  and  Heuristics  as  coded  in  LISP 

The  reader  may  wish  to  inspect  the  actual  LISP  encoding  of  concepts  and  their  facets  - 
including  heuristic  rules.  For  that  reason,  a  few  pages  are  excerpted  from  the  AM  program 
and  shown  below. 

The  facets  of  a  concept  are  stored  as  properties  on  its  property  list.  Each  facet  has  a  rigid 
format  that  it  must  adhere  to;  that  format  varies  from  facet  to  facet. 

Two  concepts  have  been  selected:  Compose,  which  is  larger  than  the  typical  concept,  and 
0 set-structure,  which  is  a  smaller  and  simpler  concept. 


Appendix  2.3.1.  The  ‘Compose’  Concept 

Here  is  the  property  list  of  the  atom  "COMPOSE",  when  AM  starts  up.  The  reader  should 
look  for  (and  find!)  parallels  between  the  complete  entries  below  and  the  abbreviated 
summaries  on  page  178.  For  that  reason,  after  each  entry,  the  corresponding  summary  line 
is  repeated  (in  a  box). 

(COMPOSE  Compose  Composition  (Afterwards)) 

Appearance  on  page  178: 


Nam«(«):  Compote,  Composition,  sometimes:  afterwards; 


DEFN  (TYPE  NEC&SUFF  PC  DECLARATIVE  SLOW  (FOREACH  X  IN  (DOMAIN  BA2) 

RETURN  (APPLYB“  BAl  ALGS  (APPLYB  BA2  ALGS  X] 

DEFN-SUFF  [[TYPE  SUFFICIENT  NONRECURSIVE  QUICK 
(AND  (ISA  BAl  ’ACTIVE) 

(ISA  BA2  ’ACTIVE) 

(ISA  BAS  ’ACTIVE) 

(ARE-EQUIV  BAS  (ALREADY-COMPOSED®^  BAl  BA2] 

[TYPE  SUFFICIENT  QUASIRECURSIVE  SLOW  (ARE-EQUIV  BAS 

(APPLYB  ’COMPOSE  ’ALGS  BAl  BA2]®® 

[TYPE  SUFFICIENT  QUASIRECURSIVE  QUICK  (EQUAL  BAS 
(APPLYB  ’COMPOSE  ’ALGS  BAl  BA2]] 

Appearance  on  page  178: 


52 

Thit  t$  thorl  for  "Engliih  nami",  and  it  tha  fKtt  callad  "Namata}'*  avarywhara  alta  in  Ihia  thaaia. 

53 

Tha  function  "APPLVB"  indicatai  that  a  concapt'a  facat  ia  to  ba  accatiad  and  than  axacutad  (APPLYB  C  F  x  y...)  meant: 
accatt  an  entry  on  facat  F  of  concept  C,  and  than  run  it  on  tha  argumanta  x,y^.. 

Thia  LISP  function  ehacka  to  tea  whether  tha  two  oparationa  have  bean  eempoaad  before. 

®®  Tha  argumanta  to  Cotnpoaa.Dafn  (and  to  Compote  Alga  at  wall)  are  called  BA  I,  BA2,..  Thua  we  would  write  each 
definition  of  Compote  at  (BAl  BA2  BAS) 
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Oflfiniiions: 

D«elar*tiv«  slowi  X  (A,B,C)  Vx,  C(x)«A(B(x). 

Sufficiant  Nonraeurtiva  Quickt  X  (A,B,C}  C  has  iha  Nama  'AoB’. 
Sufficiant,  Slow:  Ara*aquivalant(C,Compo6a.Alfs(A,B)). 
Sufficiant,  Quick:  CaComposa.Alga(A,B). 


D-R  ((OPERATION  ACTIVE  OPERATION) 

(RELATION  RELATION  RELATION) 

(PREDICATE  ACTIVE  PREDICATE) 

(ACTIVE  ACTIVE  ACTIVE)) 

D-R-FILLINl  (PROGN  (ARGS-ASA  COMPOSE  FI  F2)  (CADAR  (CON-MERGE-ARGS®*  FI  F2))) 
EXS-D-R-FILLINl  [PROGN  (ARGS-ASA  COMPOSE  FI  F2) 

[SETQ  RANI  (LAST  (ANYIOF  (GETB  FI  ’D-R]  (*  RANI  is  the  range  of  FI) 

[SETQ  DOMl  (ALL-BUT-LAST  (ANYIOF  (GETB  FI  ’D-R] 

[SETQ  RAN2  (LAST  (ANYIOF  (GETB  F2  ’D-R]  (•  RAN2  is  the  range  of  F2) 

[SETQ  DOM2  (ALL-BUT-LAST  (ANYIOF  (GETB  F2  ’0-R] 

[SETQ  X  (MAXIMAL  RAN2  DOMl  ’FRAC-OVERLAP] 

(NCONCl  (LSUBST  DOM2  for  X  in  DOMl)  RANI] 


Appearance  on  page  178: 


Domain/ranga:  <Activa  Activa  Aetiva> 

<Oparation  Activa  OparatianX 

<Pradicata  Activa  PradicataX 

<Ralation  Ralation  RalationX 

Fillin:  2  (out  of  a  total  of  9)  hauristics. 

In  Appandix  3,  thaca  ara  hauristics  numbars  175  and  176. 


ALGS  ((TYPE  QUASIRECURSIVE  INDIRECT  CASES  [PROGN 
(COND 

((NULL  BAD 
(APPLYB  ’COMPOSE 
’ALGS 

(RAND-MEMB  (EXS®^  ACTIVE)) 

BA2  BA3  BA4))®* 


56 

Thi*  i*  •  LISP  function,  opiqut  to  AM,  which  inclyxm  tho  Domiin/rinfo  fKOt*  of  tho  two  oporitiont  F I  «nd  F2,  and 
teot  how  (if  ft  ill)  tho  rinfi  of  Fi  cm  bo  midt  to  ovirlip  tho  domain  of  F2.  Not#  that  F2  io  appliod 
AFTER  Fi.  Tho  LISP  coda  for  thia  function  ia  praaanlad  on  pa|a  221. 

®^  Tha  function  *EXS*  ripplaa  outward  from  ita  arfumant,  collactinf  axamplaa  aa  it  |oaa. 

58 

Nota  what  Ihia  clauaa  aaya;  if  (^mpoiaAlfa  ia  avar  callad  with  Ha  firat  ar|umant  miaaini,  randomly  aalact  an  Activa  to 
uaa  aa  that  conatituant  of  tha  eompoaition. 
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&59 

((ALREADY-COMPOSED  BAl  BA2)  (♦  Note:  this  sets  GTEMP12)  GTEMP12) 

((AND  BAl  BA2  (IS-CON®°  BAl) 

(IS-CON  BA2) 

(ISA  BAl  ’ACTIVE) 

(ISA  BA2  ’ACTIVE) 

(SETQ  GTEMPll  (CON-MERGE-ARGS  BAl  BA2  GTEMP12))) 

(*  GTEMP12  is  now  the  name  of  the  new  composition) 

(CREATEB®'  GTEMP12) 

[SETQ  GUPl  (COND  ((ISAG  CS-B  ’COMPOSE)  CS-B)  (T  ’COMPOSE] 

(*  GUPl  is  now  the  KIND  of  concept  which  GTEMP12  is  to  be  an  example  of. 
This  will  usually  be  "COMPOSE"  or  some  variant  of  it. ) 

tlNCRB®^  GTEMP12  ’DEFN 
(LIST  ’TYPE  ’APPLICATION  ’OP  GUPl 

(APPEND  (LIST  ’APPLYB  (Q*®  COMPOSE)  (Q  ALGS)  (KWOTE  BAl)  (KWOTE  BA2)) 
(FIRSTN  (LENGTH  (CAAR  GTEMPll))  BA-LIST] 

(*  Another  way  to  fill  in  an  entry  for  GTEMPi2.Defn) 

(COND 

([SETQ  GTEMP308  (CAR  (SOME  (EXS  COMPOSE) 

(FUNCTION  (LAMBDA  (C) 

(MEMBER  (LASTELE  (GETB  GTEMP12  ’DEFN)) 

(GETS  (LASTELE  C)  ’DEFN] 

(FORGET-CONCEPT  GTEMP12) 

(CPRINIS  8  GTEMP12  turned  out  to  be  equivalent  to  GTEMP308  DCR)®'* 
GTEMP308) 

(T  (INCRB  GUPl  ’EXS  (NCONCl  (GEARGS  GUPl)  GTEMP12)) 

[SOME  (RIPPLE  GUPl  ’GENL) 

(FUNCTION  (LAMBDA  (G) 

(SOME  (GETB  G  ’D-R) 

(FUNCTION  (LAMBDA  (D) 

(AND  (ISA  BAl  (CAR  D)) 

(ISA  BA2  (CADR  D)) 

(INCRB  GTEMP12  ’UP*®  (CADDR  D)) 

(INCRB  (CADDR  D)  ’EXS  GTEMP12] 

(a  This  last  INCRB  says  that  if  an  operation  f  maps  onto  range  C, 
and  we  apply  f  and  get  a  new  Being,  then  that  Being  ISA  C)** 

(INCRB  GTEMP12  ’IN-RAN-OF  GUPl) 

(INCRB  BA2  ’IN-DOM-OF  GUPl) 


CQ 

Similar  to  laat  cast,  takaa  cara  of  mittinf  aacond  ar|umant.  Tha  amparaand,  "A”,  indieataa  an  omiaaion  from  thia  liating. 

An  abbraviafion  for  (APPLYB  'ANY-CONCEPT  'DEFN  BAD;  ia.,  taat  whalhar  BAl  ia  a  bona  fida  eoneapt  or  not, 

**  CREATES  ia  a  function  which  aata  up  a  naw  blank  data  atructura  for  a  naw  concapt. 

Tha  function  call  (INCRB  C  F  X)  maana:  add  antry  X  to  tha  F  facat  of  concapt  C. 

63 

Tha  LISP  function  ‘Q’  ia  lika  a  doubla  quota:  aftar  ona  avaluation  {Q  X)  raturna  'X;  aftar  ona  mora  avaluation,  'X  raturna 
X:  aftar  a  final  avaluation,  wo  gat  tha  VALUE  of  X. 

64 

A  conditional  print  atatamant.  If  tha  varboaity  laval  ia  high  anough  (>8),  thia  maaaaga  ia  lypad  out  to  tha  uaar.  Nota  tha 
intormixing  of  variablaa  (ag.,  ’GTEMPSOS")  and  undafinad  atoma  (a^.,  "aquivalant'').  CPRINIS  axaminaa 
aach  argumant,  and  if  it  ia  undafinad,  it  quotaa  it. 

*®  Tha  ISA’a  fKot  ia  callad  "UP"  in  tha  LISP  program. 

**  Thia  ia  a  atraamlinad,  apacializad  varaion  of  tha  mora  ganaral  hauriatic  rula  numbar  154;  aaa  paga  259. 


-  _ -  .■  a  ~  - * - * - 
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(INCRB  BAl  ’IN-DOM-OF  GUPl) 

{*  Now  see  if  the  composition  GTEMP12  shares  any  ISA’s  entries  with 
either  constituent  operation:  BAl  or  BA2)®^ 

[MAPC  [INTERSECTION  (SET-OIFF  [UNION  (GETB  BAl  ’UP)  (GETB  BA2  ’UP] 

(GETB  GTEMP12  ’UP] 

(FUNCTION  (LAMBDA  (2) 

(COND 

((DEFN  2  GTEMP12) 

(INCRB  2  ’EXS  GTEMP12) 

(INCRB  GTEMP12  ’UP  2] 

(COND 

[(GETB  GTEMP12  ’UP) 

(SETB  GTEMP12  ’GUP  (COPY  (GETB  GTEMP12  ’UP] 

(T  (INCRB  GTEMP12  ’UP  ’OPERATION) 

(INCRB  ’OPERATION  ’EXS  GTEMP12))) 

&  (*  A  similar  search  now  for  GENL/SPEC  of  the  composition) 

(SETB  GTEMP12  ’D-R  (CAR  GTEMPl))) 

(INCRB  GTEMPl  2  ’ALGS 

(LIST  ’TYPE  ’NONRECURSIVE  ’APPLICATION  ’OF  GUPl  (CADR  GTEMPl  1») 

&  («  Code  for  synthesizing  a  Defn  entry  for  GTEMPl 2) 

(SETB  GTEMPl  2  ’WORTH 

(MAP2CAR  (GETB  BAl  ’WORTH)  (GETB  BA2  ’WORTH)  ’TIMESIOOO)) 
(GS-CHECK**  GTEMPl 2]]))] 

Appearance  on  page  178: 


Algorithms: 

Distributed:  use  the  heuristics  attached  to  Compose  to  guide  the  filling 
in  of  various  facets  of  the  new  composition. 

(The  heuristics  referred  to  are  shown  in  Appendix  3.6,  on  page  263.) 
Fillin:  5  (out  of  a  total  of  9)  heuristics.  , 

Check:  1  heuristic  (out  of  a  total  of  2) 


UP  (OPERATION) 

Appearance  on  page  178: 
Isa's:  Operation 


Thi*  n«xt  MAPC  j«  thu*  1h«  LISP  •ncoding  of  haurnlic  rule  numbtr  177;  pt|t  263. 

6S 

Thi»  it  $  |tn«ril-purpott  function  for  taotinf  thit  thoro  it  no  hkfdtn  eyelt  in  tht  Gtnortlicttion  notwork,  that  no  two 
concaptt  trt  both  itnartliittiont  tnd  tptcitlixttiont  of  tKh  othor,  unlatt  thoy  art  tt||ad  at  bainf 
aquivalant  to  tKh  othar. 
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WORTH  (300) 
Appearance  on  page  I7S: 
Worth!  300 


INT®®  [(IMATRIX  (1  2  3)  (4  5)) 

(COND  [(INTERSECTION  (MAPAPPEND  (GETB  BA2  ’D-R)  ’LAST) 

(MAPAPPEND  (GETB  BAl  ’D-R)  ’ALL-BUT-LAST)) 

300 

(IDIFF  400  (ITIMES  100  (IPLUS  (LENGTH  (GETB  BAl  ’D-R)) 

(LENGTH  (GETB  BA2  ’D-R] 

(REASON  (♦  In  some  interpretation,  Range-of-op2  is  1  component  of  Domain-of-opl))) 
(COND  [[MEMB  [CAR  (LAST  (CAR  (GETB  BA2  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BAl  ’D-R] 

400 

(IDIFF  1000  (ITIMES  100  (LENGTH  (CAR  (GETB  BAl  ’D-R] 

(REASON  (*  In  canonical  interpretation,  Range-of-op2  is  a  component  of  Domain  of  opl))) 
(COND  [(INTERSECTION  (GETB  CS-B  TIES) 

(UNION  (GETB  BAl  TIESKGETB  BA2  TIES))) 

100 

(ITIMES  100  [LENGTH  (INTERSECTION  (GETB  CS-B  TIES) 

(UNION  (GETB  BAl  TIESKGETB  BA2  TIES]) 

(REASON  (♦  This  composition  preserves  some  good  properties  of  its  constituents))]) 
(COND  [(SET-DIFFERENCE  (GETB  CS-B  TIES) 

(UNION  (GETB  BAl  TIESKGETB  BA2  TIES))) 

100 

(ITIMES  100  [LENGTH  (SET-DIFFERENCE  (GETB  CS-B  TIES) 

(UNION  (GETB  BAl  TIESKGETB  BA2  TIES]) 

(REASON  (♦  This  composition  has  some  new  props,  not  true  of  either  constituent))]) 
(COND  [(OR  (GREATER?  (GETB  BAl  ’WORTH)  500)) 

(GREATER?  (GETB  BA2  ’WORTH)  500))) 

300 

(IQUOTIENT  (ITIMES  (GETB  BAl  ’WORTHKGETB  BA2  ’WORTH)) 

1000) 

(REASON  (*  Opl  and/or  Op2  are  very  interesting  themselves))]) 

(COND  [[IS-ONE-OF  [CAR  (LAST  (CAR  (GETB  BA2  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BAl  ’D-R] 

350 

(IDIFF  [ITIMES  100  (IDIFF 

[LENGTH  (CAR  (GETB  BAl  ’D-R] 

(LENGTH  (RIPPLE  [IS-ONE-OF 

[SETQ  TMP4  (CAR  (LAST  (GETB  BA2  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BAl  ’D-R] 

’GENL] 

(ITIMES  50  (LENGTH  (RIPPLE  TMP4  ’GENL] 

(REASON  (*  In  canonical  interpretation,  Range-of-op2  is  a  specialization  of  a  component 
of  Domain-of-opl))) 


69 


Not*  that  although  th«  Fillir,  and  Suggaat  hauriatiea  ara  blandad  into  tha  ralavant  facata  (a.g.,  into  tha  Algorithma  for 
COMPOSE),  tha  INTERESTINGNESS  typa  hauriatiea  ara  kapt  aaparata,  in  thia  facat. 
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(COND  [[MEMB  [CAR  (LAST  (CAR  (GETS  BAl  ’D-R] 

(ALL-BUT*LAST  (CAR  (GETB  BA2  ’D*R] 

450  ; 

(IPLUS  300  (COND  ([MEMB  [CAR  (LAST  (CAR  (GETB  BAl  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BAl  ’D-R] 

10) 

(T  250)) 

(COND  ([MEMB  [CAR  (LAST  (CAR  (GETB  BA2  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BA2  ’D-R] 

11) 

(T  250)) 

(ITIMES  70  (LENGTH  (RIPPLE  [CAR  (LAST  (CAR  (GETB  BAl  ’D-R]  ’GENL] 
(REASON  (*  In  canonical  interpretation, 

Range-of-opl  is  one  component  of  Domain-of-op2)) 

& 

(COND  [[ISA  [CAR  (LAST  (CAR  (GETB  BAl  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BA2  ’D-R] 

250 

(IPLUS  50  (COND  ([ISA  [CAR  (LAST  (CAR  (GETB  BAl  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BAl  ’D-R] 

10) 

(T  100)) 

(COND  ([ISA  [CAR  (LAST  (CAR  (GETB  BA2  ’D-R] 

(ALL-BUT-LAST  (CAR  (GETB  BA2  ’D-R] 

11) 

(T  100)) 

(ITIMES  50  (LENGTH  (RIPPLE  [CAR  (LAST  (CAR  (GETB  BAl  ’D-R]  ’GENL] 
(REASON  (a  Range-of-opl  is  a  specialization  of  a  component  of  Domain-of-op2] 

Appmanet  on  pagt  178;  I 


Interest:  1 1  heurittiei. 

The  heuristic  rules  encoded  above  are  shown  in  English  on  page  265. 


//era  is  the  code  for  CON-ME'RGE-ARGS,  the  function  which  decides  how  to  overlap 
the  domainirange  facets  of  its  two  arguments,  FI  and  F2: 


<CON.MERGE-ARGS 

(LAMBDA  (FI  F2  FI2  PGMI  SCHK  ftAPL  OOMl  D0M2  RANI  RAN2  TIL  OOM3) 
(SETQ  RANI  (LAST  (CAR  (GETB  FI  ’D-R] 

(SETO  DOM  I  (LDIFF  (CAR  (GETB  FI  ’D-R)) 

RAND) 

(SETQ  RAN2  (LAST  (CAR  (GETB  F2  ’D-R] 

(SETO  D0M2  (LDIFF  (CAR  (GETB  F2  ’D-R» 

RAN2)} 

(SETO  00M3  (AND  (CDR  OOMl) 

(LIST  (CAOR  (MIN2  (APPEND  RAN2  RAN2  RAN2 
RAN2)  DOM  I  ’FRAC-OVERLAP] 

(•  Aa  DOMi  and  RANi  ara  locatad,  Switchinf  of  Arfa  may  ba  raquirad,  maida  PGMI) 
(AND  (MEMB  (CAR  00M3)  OOM2)  (SETQ  OOM3  NIL)) 
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(SETQ  GTEMP20  (LENGTH  D0M2)) 

[SETO  SAPL  (NCONC  (LIST  *APPLYB  (KWOTE  FI)  (0  ALGS)) 

(MAPCAR  (SUB-ONCE  ’X 

[SETQ  GT£MPie(COND 

(dS-ONE-OF  (CAR  RAN2)  00M1)> 

[{SETQ  SCHK  (ONE-ISAG  DOM  I  (CAR  RAN2} 

((SETQ  SCHK  (AND  (SETQ  TIL  (EXS  (CAR  RAN2)) 

(CAR  (SOME  DOM  I  (FUNCTION  (LAMBDA  (D) 

(INTERSECTION 

TIL 

(EXS  D] 

DOMI) 

(FUNCTION  (LAMBDA  (2) 

(COND 
((EQ  Z  ’X) 

■X) 

(T  (SETQ  GTEMP20  (AOOl  6TEMP20)) 

(CAR  (FNTH  BA-LIST  GTEMP20) 

(•  SCHK  ii  I  Hag  whkii  maant  that  (2  mapt  ua  into  an  alanant  of  RAN2  <’/hich  ia  not  guarantaad 
a  priori  to  ba  an  alamant  of  DOM  I,  hanea  a  ehack  for  thia  applicability  *f  ft  will  than  hava  to  ba  matla) 
(COND 

//rijpijR  *y  QAPi  \ 

(SETQ  D0M3  (REM-ONCE  GTEMPI9  DOMD) 

(SETQ  GTEMP7  (APPEND  D0M3  DOM2)> 

[COND 

[(NEQ  (LENGTH  GTEMP7) 

(LENGTH  (SELF-INT  GTEMP7))) 

(CPRINIS  9  CRLF  CRLF  AM  can  latar  coalaaca  tha  D-R  of  FI2  DCR) 

(ADD-CANDS  (LIST  (LIST  (LIST  ’APPLYB  (0  COALESCE)  (Q  ALGS)  (KWOTE  Ft 2)) 

(IPLUS  100  (iOUO  (DOTPROO  (FIRSTN  2  (GETS  Ft  ’WORTH)) 

(GETS  F2  ’WORTH))  2000)) 

(LIST  (SPLIST  Thara  ia  an  ovarlap  in  tha  naw  eombinad 
domain  of  tha  oparation  FI 2] 

(SWHY  9  (Thara  ia  an  obvioua  ovarlap  in  (•  GTEMP7),tha  naw  eombinad  domain  of  (ft  FI 2] 

TAe  next  piece  of  this  function  is  the  heuristic  rule  numbered  186  in  Appendix  S. 

([SOME  GTEMP7  (FUNCTION  (LAMBDA  (X) 

(IS-ONE-OF  X  (CDR  (FMEMB  X  GTEMP7) 

(CPRINIS  10  CRLF  CRLF  AM  may  latar  coalaaca  tha  D-R  of  Ft2  DCR) 

[ADD-CANDS  (LIST  (LIST  (LIST  'APPLYB  (0  COALESCE)  (Q  ALGS)  (KWOTE  FI 2)) 

(IQUO  (DOTPROD  (FIRSTN  2  (GETB  FI  ’WORTH)) 

(GETB  F2 ’WORTH))  2500)) 

(LIST  (SPLIST  Thara  may  ba  an  ovarlap 

in  tha  naw  combined  domain  of  tha  operation  FI 2] 

(SWHY  to  (There  ia  a  aubtle  overlap  in  (ft  GTEMP7),tha  new  combined  domain  of  (ft  FI 2] 

(SETQ  PGM  I  (LIST  'PROG 
(LIST  ’X) 

[LIST  ’SETQ  ’X 

(NCONC  (LIST  ’APPLYB  (KWOTE  F2)  (0  ALGS)) 

(FIRSTN  (LENGTH  D0M2)  (LIST  ’BA  I  ’BA2  ’BA3) 

(LIST  ’RETURN 
(COND 

(SCHK  (LIST  ’AND 

(LIST  ’APPLY.  (0  OEFN)  (KWOTE  SCHK)  ’X) 

SAPD) 

(T  (LIST  'AND  ’X  SAPL) 

(LIST  (LIST  (APPEND  D0M2  D0M3  RAND)  PGMt)) 

(T  (.  Compoeing  ia  not  poaaibla)  NIL]) 
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Appandiy  2.3.2.  The  '0»et$*  Concept 

Here  is  the  actual  property  list  of  the  data-structure  corresponding  to  the  Osets  concept: 


ENGN  (QSET  Oset  Oset-structura  OSET-STRUC,  Ordarad-sat  (Sat)) 

DEFN  (TYPE  NEC&SUFF  RECURSIVE  TRANSPARENT  [COND 
((EQUAL  BAl  (OSET  ))  T) 

(T  (APPLYB  ’OSET  ’DEFN  (APPLYB  ’OSET-DELETE  ’ALGS 

(APPLYB  ’SOME-MEMB  ’ALGS  BAl) 

BAl]) 

(TYPE  NEC&SUFF  RECURSIVE  QUICK  [COND 
((EQUAL  BAl  ’(OSET  ))  T) 

((CDDR  BAl)  (APPLYB  ’OSET  ’DEFN  (RPLACD  BAl  (CDDR  BAl))) 

(T  NIL]) 

(TYPE  NEC4SUFF  NONRECURSIVE  QUICK  (MATCH  BAl  WITH  (’OSET  f))) 
GENL  (ORD-STRUC  NO-MULT-ELES-STRUC) 

WORTH  (400) 

IN-DOM-OF  (OSET-JOIN  OSET-INTERSECT  OSET-DIFF  OSET-INSERT  OSET-DELETE) 
IN-RAN-QF  (OSET-JOIN  OSET-INTERSECT  OSET-DIFF  OSET-INSERT  OSET-DELETE) 
VIEW  (STRUCTURE  (RPLACA  BAl  ’OSET)) 


Compare  this  with  the  way  that  the  "Osets”  concept  appeared,  on  page  214  of  Appendix 

2.1: 


Nama(t)s  Otat,  Oeat’etructura,  Ordarad-aat,  aomatimait  Sat. 

Oafinitionit 

Raeuraivat  \  ($)  ($■[  ]  or  Oiat.Oafin)tion(Oiat-Dalata.Alg(MambarAI|($),$))> 
Raeurtiva  quick:  X  ($)  ($•[  ]  or  Oaat.Dafinition  (CDR(S))) 

Quick:  X  ($)  (Match  S  with  [...] ) 

Ganaraiisationa:  Ordarad-Struetura,  No>multipla'’aiamanta'$tructura 
Worth:  400 

In-domain-of:  Oaat'union,  Oaat*intaraact,  Oaat-diffaranca,  Oaat*inaart,  Oaat'dalata 
ln-r8nKa*af:  Oaat'union,  Oaat*intaraact,  Oaat-dIffaranca,  Oaot>insort,  Oaat-dalata 
Viaw:  To  viaw  any  atructura  aa  a  Oaat,  do:  X  (x)  Encloao-in-aquara-brackatafy) 
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The  list  below  is  meant  to  suggest  the  range  of  AM’s  definitions;  it  is  far  from  complete,  and 
most  of  the  omissions  were  real  losers.  The  concepts  are  listed  in  the  order  in  which  they 
were  defined.^®  In  place  of  the  (usually-awkward)  name  chosen  by  AM,  I  have  given  either 
the  standard  math/English  name  for  the  concept,  or  else  a  short  description  of  what  it  is. 

Sets  with  less  than  2  elements  (singletons  and  empty  sets). 

Sets  with  no  atomic  elements  (nests  of  braces). 

Singleton  sets. 

Bags  containing  (multiple  occurrences  of)  just  one  kind  of  element. 

Superset  (contains). 

Doubleton  bags  and  sets. 

Set-membership. 

Disjoint  bags. 

Subset. 

Disjoint  sets. 

Singleton  osets. 

Same-length  (same  number  of  elements). 

Same  number  of  left  parentheses,  plus  identical  leftmost  atoms. 

Count  (find  the  number  of  elements  of  a  given  structure). 

Numbers  (unary  representation). 

Add. 

Minimum. 

SUBl  (X  (x)x-l). 

Insert  x  into  a  given  Bag-of-T's  (almost  ADD  I,  but  not  quite). 

Subtract  (except:  if  x<y,  then  the  result  of  x-y  will  be  zero^'). 

Less  than  or  equal  to. 

Times. 

Union  of  a  bag  of  structures. 

&  (the  ampersand  represents  the  creation  of  several  real  losers.) 

Compose  a  given  operation  F  with  itself  (form  FoF). 

Insert  structure  S  into  itself. 

Try  to  delete  structure  S  from  itself  (a  loser). 

Double  (add  ’x’  to  itself). 

Subtract  ’x’  from  itself  (as  an  operation,  this  is  a  real  zero^^). 

Square  (TIMES(x,x)). 

Union  structure  S  with  itself. 

Coalesced-replace2:  replace  each  element  s  of  S  by  F(s,s). 

Coalesced-join2:  append  together  F(s,s),  for  each  member  s<S. 

Coa-repeat2:  create  a  new  op  which  takes  a  struc  S,  op  F,  and  repeats  F(s,t,S)  all  along  S. 
Compose  three  operations:  X(F,G,H)  Fo(GoH). 

Compose  three  operations:  X(F,G,H)  (FoG)oH. 

S*t  Apptndix  5.2,  p.  294,  for  t  dtliiltd  Irict  of  how  thtto  eoncopt*  wort  ditcovtrtd.  Or  «m  Stetion  6.1,  p.  US, 
for  •  briofor  vortion  of  tho  tamo  dovolopmont. 

^  *  Thit  it  ’nalural-numbtr  tubtricf,  in  Iht  tamt  tpirit  of  namini  at  wt  find  for  ’Inttftr  divition*. 

72 

»  naturil  ztro? 
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&  (lets  of  losing  compositions  c^r  ited,  e.g.  Self-insertoSet-union.) 

ADD*'(x):  all  ways  of  representing  x  as  the  sum  of  a  bunch  of  nonzero  numbers. 
CoH,  s.t.  H(G(H(x}))  is  always  defined  (wherever  H  is),  and  G  and  H  are  interesting. 
InsertoDelete. 

Deleteolnsert. 

SizeoADD*'.  {X  (n)  The  number  of  ways  to  partition  n) 

Cubing 

Sc 

Exponentiation. 

Halving  (in  natual  numbers  only;  thus  Halving(15)-7). 

Even  numbers. 

Integer  square-root. 

Perfect  squares. 

Divisors-of. 

Numbers-with-O-divisors. 

Numbers-with- 1-divisor. 

Primes  (Numbers-with-2-divisors). 

Squares  of  primes  (Numbers-with-S-divisors). 

Squares  of  squares  of  primes. 

Square-roots  of  primes  (a  loser). 

TIMES*'(x):  all  ways  of  representing  x  as  the  product  of  a  bunch  of  numbers  (>1). 
All  ways  of  representing  x  as  the  product  of  Just  one  number  (a  trivial  notion). 

Ail  ways  of  representing  x  as  the  product  of  primes. 

All  ways  of  representing  x  as  the  sum  of  primes. 

Ail  ways  of  representing  x  as  the  sum  of  two  primes. 

Numbers  uniquely  representable  as  the  sum  of  two  primes. 

Products  of  squares. 

Multiplication  by  1. 

Multiplication  by  0. 

Multiplication  by  2. 

Addition  of  0. 

Addition  of  1. 

Addition  of  2. 

Product  of  even  numbers. 

Sum  of  squares. 

Sum  of  even  numbers. 

&;  (losers:  various  compositions  of  3  operations.) 

Pairs  of  perfect  squares  whose  sum  is  also  a  perfect  square  (x^^y^-z^). 

Prime  pairs  (p^«2  are  prime). 
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Appendix  3.  AM’s  Heuristics 


Infallible  rules  of  discovery  leading  to  the  solution  of  all  possible  mathematical 
problems  would  be  more  desirable  than  the  philosophers'  stone,  vainly  sought  by 
the  alchemists.  Such  rules  would  work  magic;  but  there  is  no  such  thing  as  magic. 
To  find  unfailing  rules  applicable  to  all  sorts  of  problems  is  an  old  philosophical 
dream;  but  this  dream  will  never  be  more  than  a  dream. 

—  Poly* 


To  the  extent  that  a  professor  of  music  at  a  conservatoire  can  assist  his  students 
in  becoming  familiar  with  the  patterns  of  harmony  and  rhythm,  and  with  how  they 
combine,  it  must  be  possible  to  assist  students  in  becoming  sensitive  to  patterns  of 
reasoning  and  how  they  combine.  T he  analogy  is  not  far-fetched  at  all 


—  Dijk»tr* 


This  appendix  lists  all  the  heuristics  with  which  AM  is  initially  provided.  They  are 
organized  by  concept,  most  general  concepts  first.  Within  a  concept,  they  are  organized  into 
four  groups; 

•  Fillin:  rules  for  filling  in  new  entries  on  various  facets. 

•  Check:  rules  for  patching  up  existing  entries  on  various  facets. 

•  Suggest;  rules  which  propose  new  tasks  to  break  AM  out  of  stagnant  loops. 

•  Interest;  criteria  for  estimating  the  interestingness  of  various  entities. 

Each  heuristic  is  presented  in  English  translation.  Whenever  there  is  a  very  tricky,  non- 
obvious,  or  brilliant  translation  of  some  English  clause  into  LISP,  a  brief  note  will  follow 
about  how  that  is  coded.  Also  given  (usually)  are  some  example(s)  of  its  use,  and  its  overall 
importance.  Concepts  which  have  no  heuristics  are  not  present  in  this  appendix. 

Hundreds  of  heuristics  were  planned  on  paper  but  never  coded  (e.g.,  those  dealing  with 
proof  techniques,  those  dealing  with  the  drives  and  rewards  of  generalized  message 
senders/receivers),  and  whole  classes  of  rules  were  coded  but  never  used  by  AM  during  any 
of  its  runs  (e.g.,  how  to  deal  with  contradictions,  how  to  deal  with  Intu’s  facets).  Such 
superfluous  rules  will  not  be  Included  here.  They  would  raise  the  total  number  of  heuristic 
rules  from  242  to  about  500. 

The  rule  numbering  in  this  Appendix  is  referred  to  occasionally  in  other  appendices.  The 
total  number  of  rules  coded  in  AM  is  actually  higher,  since  many  rules  are  present  but 
never  used,  and  since  many  rules  listed  with  one  number  here  are  really  several  rules  in 
LISP  (e.g.,  see  rules  8?  and  129). 
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It  would  be  advantageous  to  have  a  cross-indexing  of  the  body  of  heuristics  along  several 
dimensions  (a  multiple  sorting  by  a  small  set  of  key  parameters):  sorted  by  interest,  by 
relevance  (the  current  arrangement),  by  cost,  by  payoff,  by  frequency  of  usage,  etc.  This  is 
left  as  a  starred  excercise  for  the  Interested  reader. 


Appendix  3 


AM  Ditcovery  in  Mathemilict  ««  Heurittic  Seerch 


-228- 


AII  these  rules  deal  with  any  item  X,  be  it  concept,  atom,  event,  etc.  These  rules  are  about 
as  general  -  and  as  weak  -  as  one  can  imagine. 


Anything  .  Suggest 

1 .  If  AM  has  racently  rafarenced  antity  X, 

.  Than  boost  tha  priority  of  any  tasks  involving  X. 


2.  If  tha  usar  has  racantly  rafarrad  to  X, 

Than  boost  tha  priority  of  any  tasks  involving  X. 

The  above  two  rules  simply  reaffirm  the  idea  of  "focus  of  attention".  The  boost  in  ratings  is 
only  slight,  and  only  temporary  (it  decays  toward  zero  exponentially  with  time).  Besides  this 
gradual  decline  in  ta.ck  ratings,  the  rule  below  explicitly  modulates  this  boosting,  so  that 
infinite  loops  can  be  ?.  jided. 

3.  If  AM  has  racantly  daalt  with  X  with  poor  rasults, 

Than  lowar  tha  priority  rating  of  all  tasks  involving  X. 


A.  If  AM  just  rafarancad  X  and  almost  suecaadad,  but  not  quita, 

Than  look  for  a  vary  similar  aniity  Y,  and  ratry  tha  activity  with  Y  in  plaea  of  X. 

There  is  a  separate  precise  meaning  for  "almost  succeed",  "similar  entity",  and  "retry"  for 
each  kind  of  entity  and  activity  that  might  be  involved.  For  example,  if  the  activity  were  a 
task  (say  to  fill  In  examples  of  Odd-primes)  and  the  entity  X  were  a  concept  (in  this  case. 
Odd-primes),  then  a  'similar  entity’  might  be  the  concept  Odd-numbers,  and  in  that  case  the 
result  of  this  rule  would  be  a  new  task  (to  fill  in  examples  of  Odd-numbers).  If  the  failure 
occurred  while  AM  was  trying  to  access  the  examples  facet  of  Primes,  with  X«Examples, 
then  a  'similar  entity’  might  be  the  Boundary-examples  facet,  and  the  above  rule  would 
suggest  that  AM  access  instead  the  Boundary-examples  facet  of  Primes.  Of  course,  this  rule 
is  so  weak  that  it  is  not  often  of  much  help. 

5.  If  space  is  running  out,  and  AM  has  not  referenced  X  for  a  long  time,  and  X  is  taking  up  a  lot  of 
space,  and  no  important  conjectures  reference  X, 

Then  X  may  be  forgotten  and  its  space  liberated.  Probably  the  user  should  be  informed  of  this, 
at  least  tersely. 

Just  a  general-purpose  directive  for  emergency  garbage-collection. 
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Anything .  Interest 

6.  Any  entity  X  it  interetting  if  it  it  referre-.  to  in  tovorai  inUratting  eonjocturot. 


7.  Any  entity  X  it  interetting  if  it  it  related  (via  a  rare,  interetting  relation)  to  another  entity 

which  aroae  in  a  very  different  way  and  it  not  obvioutly  tied  to  X. 

Unexpected  connections  are  worth  closer  examination,  typically.  X  might  be  ‘related  to’  Y 
because  F(X)-Y  (for  some  very  interesting  operation  F),  because  Y(X)  is  true  (for  some 
rarely-satisfied  predicate  Y),  because  some  conjecture  involving  X  is  syntactically  identical  to 
the  same  conjecture  involving  Y,  etc. 

8.  Entity  X  it  (tentatively)  interesting  if  there  is  an  analogy  in  which  X  corresponds  to  Y,  and  Y 

has  turned  eut  to  be  very  interesting. 


9.  If  entity  X  is  an  example  of  concept  C,  and  X  satiifies  some  features  on  C.lnt, 

Then  X  is  interesting,  and  C’a  Interestingneis  features  will  indicate  a  numeric  rating  for  X. 

This  is  practically  the  definiton  of  the  Int  facet.  Below  is  a  much  more  ususual  rule: 

10.  If  entity  X  is  an  example  of  concept  C,  and  X  satisfies  absolutely  none  of  the  features  on  C.lnt, 

and  X  is  just  about  the  only  C  which  doesn’t  satisfy  something. 

Then  X  is  interesting  because  of  its  unusual  boringness. 

Since  most  singletons  are  interesting  because  gll  pairs  of  their  elemenu  are  Equal,  the  above 
rule  says  it  would  be  interesting  actually  to  find  a  singleton  for  which  not  all  pairs  of  its 
members  were  equal.  While  it  would  be  interesting,  AM  has  very  little  chance  of  finding 
such  a  critter. 
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Appendix  3.2.  Heuristics  for  dealing  with  Anv-concept 

This  concept  has  a  huge  number  of  heuristics.  For  that  reason,  I  have  partitioned  off  — 
both  here  and  in  AM  itself’  -  the  heuristics  which  apply  to  each  kind  of  facet. 


Appendix  3.2.1.  Heuristics  for  any  facet  of  Any-concept 

The  first  set  of  heuristics  we’ll  look  at  are  very  general,  applying  to  no  particular  facet 
exactly. 


Any-concept .  Fillin 


II.  Whan  frying  to  fill  in  fecat  F  of  coneopt  C,  for  any  C  and  F, 

If  C  is  analogous  to  concept  X,  and  X.F  has  somo  onirics, 

Then  try  to  construct  tha  analogs  of  thos*  ontrias,  and  tea  if  they  ara  really  valid  antrias  for 


C.F. 


Recall  that  "C.F"  is  shorthand  for  "facet  F  of  concept  C".  This  rule  simply  says  that  if  an 
analogy  exists  between  two  concepts  C  and  X,  then  it  may  be  strong  enough  to  map  entries 
on  X.F  into  entries  for  C.F.  Note  that  F  can  be  any  given  facet.  There  is  an  analogy 
between  Sets  and  Bags,  and  AM  uses  the  above  rule  to  turn  the  extreme  example  of  Sets  - 
the  empty  set  -  into  the  extreme  kind  of  bag. 


Any-concept .  Suggest 

1 2.  If  tha  F  facat  of  concapt  X  is  blank, 

Than  considar  trying  to  fill  it  in. 

The  above  super-weak  rule  will  result  in  a  new  task  being  added  to  the  agenda,  for  every 
blank  facet  of  every  concept.  It  is  more  of  a  legal  move  generator  than  a  plausible  move 
proposer.  The  rating  of  each  such  task  will  depend  on  the  Worth  of  the  concept  X  and  the 
overall  worth  of  the  type  F  facet,  but  In  all  cases  will  be  very  small.  The  "emptiness"  of  a 
facet  is  always  a  valid  reason  for  trying  to  fill  it  in,  but  never  an  a  priori  important  reason. 
So  the  net  effect  of  the  rule  is  to  slightly  bias  AM  toward  working  on  blank  -  rather  than 
non-blank  -  fai'ets. 

13.  While  trying  to  fill  in  facet  F  of  concept  C,  for  any  C  and  F,  if  C  is  Known  to  be  similar  to  some 

other  concept  D,  except  for  difference  d, 

Then  try  to  fill  in  C.F  by  selecting  items  from  D.F  for  which  d  is  nonexistent. 

This  rule  is  made  more  specific  when  F  is  actually  known,  and  hence  the  format  of  d  is 
actually  determined.  For  example,  if  C-Rever$e-at-all-levels,  F-examples,  then  (at  one 
particular  moment)  a  note  is  found  on  the  Conjees  facet  of  concept  C  which  says  that  C  is 


I 


Thu*  th«  LISP  profram  h«*  «  «*parit*  concept  C(ll*d  "Extmplet-of-any-concapt",  tnolher  concept  celled  "O*finition*-ef- 
eny-concept",  etc. 
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just  like  the  concept  D^Reverze-top-levei,  except  C  also  recurs  on  the  nonatomic  elements  of 
its  arguments,  whereas  D  doesn't.  Thus  d  is  made  null  by  choosing  examples  of  D  for 
which  there  are  no  nonatomic  elenwnts.  So  an  example  like  'Reverse-top-lev9l(<a  b  c>)-<c  b 
a>’  will  be  selected  and  will  lead  to  the  proposed  example  'Revers9-at-all-l8vels(<a  b  c>)-<c 
b  a^’,  which  is  in  fact  valid. 

i  4.  After  dealing  with  concapt  C, 

Slightly,  temporarily  boost  the  priority  value  of  each  existing  task  which  involves  an  Active 
concept  whoso  domain  or  range  Is  C. 

This  is  done  efficiently  using  the  In-dom-of  and  In-ran-of  facets  of  C.  A  typical  usage  was 
after  checking  the  just-filled-in  examples  of  Bags,  when  AM  slightly  boosted  the  rating  of 
filling  in  examples  of  Bag-union,  and  this  task  Just  barely  squeaked  through  as  the  next  one 
to  be  chosen.  Note  that  the  rule  reinforced  that  task  twice,  since  both  domain  and  range  of 
Bag-union  are  bags. 


Any-concept .  Check 

1 5.  When  checking  facet  F  of  concept  C,  (for  any  F  and  C,) 

Prune  sway  at  the  entries  there  until  the  facet’s  size  is  reduced  to  the  size  which  C  merits. 

The  algorithm  for  doing  this  is  as  follows:  The  Worth  of  C  is  multiplied  by  the  overall 
worth  of  facet  type  F.  This  is  normalized  in  two  ways,  yielding  the  maximum  amount  of  list 
cells  that  C.F  may  occupy,  and  also  yielding  the  maximum  number  of  separate  entries  to 
keep  around  on  C.F.  If  either  limit  is  being  exceeded,  then  an  entry  is  plucked  at  random 
(but  weighted  to  favor  selection  from  the  rear  of  the  facet)  and  excised.  This  repeats  as  long 
as  C.F  is  oversized.  As  space  grows  tight,  the  normalization  weif’^'s  decline,  so  each 
concept’s  allocation  is  reduced. 

I€.  When  checking  facet  F  of  concept  C, 

Eliminate  redundant  entries. 

Although  it  might  conceivably  mean  something  for  an  entry  to  occur  twice,  this  was  never 
desirable  for  the  set  of  facets  which  each  AM  concept  possesMd. 


Any-concept .  Interest 

The  interest  features  apply  to  tell  hew  interesting  a  concept  is,  and  are  rarely  subdivided  by 
relevant  facet.  That  is,  most  of  the  reasons  that  Any  concept  might  be  interesting  will  be 
given  below. 

17.  A  concept  X  is  inioroiiing  if  X.(!onjoca  contains  aemo  intereating  entries. 

It.  A  concept  Is  Interesting  if  its  boundary  eecldentslly  coincides  with  another,  well-known, 
Interesting  concept. 
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The  boundary  of  a  concept  means  the  items  which  just  barely  fall  into  (or  just  barely  miss 
satisfying)  the  definition  of  that  concept.  Thus  the  boundary  of  Primes  might  include  1,2, 3, 4. 
If  the  boundary  of  Even  numbers  includes  numbers  differing  by  at  most  1  from  an  even 
number,  then  clearly  their  boundary  is  all  numbers.  Thus  it  coincides  with  the  already- 
known  concept  Numbers,  and  this  makes  Even-nos  more  interesting.  This  expresses  the 
property  we  intuitively  understand  as:  no  number  is  very  far  from  an  even  number. 

19.  A  concept  is  inferestir.g  if  its  boundary  accidentally  coincides  with  the  boundary  of  another, 

very  different,  interesting  concept. 

Thus,  for  example.  Primes  and  Numbers  are  both  a  little  more  interesting  since  the  extreme 
cases  of  numbers  are  all  boundary  cases  of  primes.  Even  numbers  and  Odd  numbers  both 
have  the  same  boundary,  namely  Numbers.  This  is  a  tie  between  them,  and  slightly  raises 
AM’s  interest  in  both  concepts. 

20.  A  concept  is  intorasting  if  it  is  —  accidentally  —  precisely  the  boundary  of  some  other, 

interesting  concept. 

In  the  case  mentioned  for  the  above  rule.  Numbers  is  raised  in  interest  because  it  turns  out 
to  be  the  boundary  for  even  and  odd  numbers. 

21.  A  concept  is  boring  if,  after  several  attempts,  only  a  couple  examples  are  found. 

Another  rule  indicates,  in  such  situations,  that  the  concept  may  be  forgotten  and  replaced  by 
some  conjecture. 

22.  Concept  C  is  interesting  if  some  normaliyinefficient  operation  F  can  be  efficiently  performed 

on  C't. 

Thus  it  is  very  fast  to  perform  Insert  of  items  into  lists  because  (i)  no  pre-existence  checking 
need  be  done  (as  with  sets  and  oseti),  and  (ii)  no  ordered  merging  need  be  done  (as  with 
bags).  So  "Lists"  is  an  interesting  concept  for  that  reason,  according  to  the  above  rule. 

23.  Concept  C  is  interesting  if  each  example  of  C  accidentally  seems  to  satisfy  the  otherwise* 

rarely  satisfied  predicate  P,  or  (equivalently)  if  there  is  an  unusual  conjecture  involving 

Cl 

This  is  almost  a  primitive  affirmation  of  intererestingness. 

24;  Concept  C  is  interesting  if  C  is  closely  related  to  the  very  interesting  concept  X. 

This  is  intererestingness  by  association.  AM  was  interested  in  Divisors-of  because  it  was 
closely  related  to  TIMES,  which  had  proven  to  be  a  very  interesting  concept. 

25.  Concept  C  is  interesting  if  there  is  an  analogy  in  which  C  corresponds  to  Y,  «nd  the  analogs  of 
the  Interest  features  of  Y  indicate  that  C  is  interesting. 

This  might  have  been  a  very  useful  rule,  if  only  there  had  been  more  decent  analogies 
floating  around  the  system.  As  it  was,  the  rule  was  rarely  used  to  advantage.  It  essentially 
says  that  the  analogs  of  Interest  criteria  are  themselves  (probably)  valid  criteria. 


%r  v'N-  xru  ?%.  .  -Vi 
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26.  A  conetpt  C  it  int«r«tting  if  on«  of  iit  gonorilizttiont  or  tpocitlizafiont  turnt  out  to  bo 

unoxpoctodly  vory  intoroiting. 

"Unexpected"  meant  that  the  Interetting  property  hadn’t  already  been  observed  for  C.  If  C 
is  interesting  in  some  way,  and  then  one  of  its  generalizations  is  seen  to  be  interesting  in 
exactly  the  same  way,  then  that  is  "expected".  It’s  almost  more  interesting  if  the  second 
concept  unexpectedly  lacks  tome  fundamental  property  about  C.  At  least  in  that  case  AM 
might  learn  something  about  what  gives  C  that  property.  In  fact,  AM  has  this  rule: 

27.  If  concept  C  poiioisoc  tome  vory  intorecting  property  lacked  by  one  of  ita  apecializationa  S, 
Then  both  C  and  S  become  slightly  more  interesting. 

In  the  LISP  program,  ths  is  closely  linked  with  rule  104. 

28.  If  a  concept  C  it  re*derived  in  a  new  way,  that  makes  it  more  interetting. 

if  concepts  Cl  and  C2  turn  out  to  be  equivalent  concepts,  then  merge  them.  The  combined 
concept  it  now  more  interetting  than  either  of  itt  predeeettort. 

The  two  conditionals  above  are  really  the  same  rule,  so  they  aren’t  given  separate  numbers. 
Cl  and  C2  might  be  conjectured  equivalent  because  their  examples  coincide,  each  is  a 
generalization  of  the  other,  their  definitions  can  be  formally  shown  to  be  equivalent,  etc. 
This  rule  is  similar  in  spirit  to  rule  number  1 14. 


The  following  heuristics  are  used  for  dealing  with  the  many  kinds  of  examples  facets  which 
a  concept  can  possess;  non-examples,  boundary  examples,  Isa  links,  etc. 


Any-concept .  Examples  .  Fillin 


29.  To  fill  in  oxamploi  of  X,  whore  X  it  a  kind  of  Y  (for  lomo  more  gonorti  concept  Y), 

Incpoct  tho  oxamploc  of  Y;  come  of  thorn  miy  bo  oxomplot  of  X  as  wall. 

Tho  furthor  romovod  Y  is  from  X,  tho  lost  cost'offoctivo  this  ruio  is. 

For  the  task  of  filling  in  Empty-structures,  AM  knows  that  concept  is  a  specialization  of 
Structures,  so  it  looks  over  all  the  then-known  examples  of  Structures.  Sure  enough,  a  few  of 
them  are  empty  (satisfy  £mpty-structures.Defn).  Similarly,  for  the  task  of  filling  in  examples 
of  Primes,  this  rule  would  have  AM  notice  that  Primes  is  a  kind  of  Number,  and  theref^ore 
look  over  all  the  known  examples  of  Number.  It  would  not  be  cost-effective  to  look  for 
primes  by  testing  each  example  of  Anything,  and  the  third  and  final  clause  in  the  above 
rule  recognizes  that  fact. 

30.  To  fill  in  non*tximplet  of  concept  X, 

Search  the  ipeciilizatione  of  X.  Look  at  ell  their  non>exampiei.  Some  of  them  may  turn  out  to 
be  non-oxamplei  of  X  at  well. 

This  rule  is  the  counterpart  of  the  last  one,  but  for  non-examples.  As  expected,  this  was  less 
useful  than  the  preceding  positive  rule. 
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31.  If  th«  current  task  is  to  fill  in  examples  of  any  concept  X, 

Then  one  way  to  get  them  is  to  symbolically  instantiate  a  definition  of  X. 

That  rule  simply  says  to  use  some  known  tricks,  some  hacks,  to  wring  examples  from  a 
declarative  definition.  One  trick  AM  knows  about  is  to  plug  already-known  examples  of  X 
into  the  recursive  step  of  a  definition.  Another  trick  is  simply  to  try  to  instantiate  the  base 
step  of  a  recursive  definition.  Another  trick  is  to  take  a  definition  of  the  form  "X  (x)  x  isa 
P,  and  <sub-expression>",  work  on  instantiating  just  the  sub-expression,  and  then  pop  back 
up  and  see  which  of  those  items  are  P’s. 

32.  If  ih«  current  task  is  to  fill  in  non*examples  of  concept  X, 

Then  one  fast  way  to  get  them  is  to  pick  any  random  item,  any  example  of  Anything,  and  check 
that  it  fails  X.Defn. 

This  is  an  affirmation  that  for  any  concept  X,  most  things  in  the  universe  will  probably  not 
be  X’s.  This  rule  was  almost  never  used  to  good  advantage:  non-examples  of  a  concept  X 
were  never  sought  unless  there  was  some  reason  to  expect  that  they  might  not  exist.  In  those 
cases,  the  presumption  of  the  above  rule  was  wrong,  and  it  failed.  That  is,  the  rule 
succeeded  iff  it  was  not  needed.^ 

33.  To  fill  in  oxamplas  of  concept  X, 

If  X.View  tolls  how  to  view  a  Z  as  if  it  were  an  X,  and  some  examples  of  Z  are  known, 

Then  just  run  X.View  on  those  examples,  and  check  that  the  results  really  are  X's. 

Thus  examples  of  osets  were  found  by  viewing  other  known  examples  of  structures  (e.g., 
examples  of  sets)  as  if  they  were  osets. 

34.  To  fill  in  examples  of  concept  X, 

Find  an  operation  whose  range  is  X,^  and  find  examples  of  that  operation  being  applied. 

To  fill  in  examples  of  Even-nos,  this  rule  might  have  AM  notice  the  operation  ‘Double’. 
Any  example  of  Double  will  contain  an  example  of  an  even  number  as  its  value:  e.g.,  <3-«6> 
contains  the  even  number  6. 

35.  If  the  current  task  is  to  fill  in  examples  of  concept  X, 

One  bizarre  way  is  to  specialize  X,  adding  a  strong  constraint  to  X.Defn,  and  then  look  for 
examples  of  that  new  specialization. 

Like  the  classical  "insane  heuristic"^  this  sounds  crazy  but  works  embarassingly  often.  If  I 
ask  you  to  find  numbers  having  a  prime  number  of  divisors,  the  rate  at  which  you  find 
them  will  probably  be  lower  than  if  I’d  asked  you  to  find  numbers  with  precisely  2  divisors. 
The  variety  of  examples  will  suffer,  of  course.  The  converse  of  this  heuristic  -  for  non- 
examples  -  was  deemed  too  unaesthetic  to  feed  to  AM. 

^  C»tch-22? 

3 

Qr  «t  IfMt  INTERSECTS  X  Utt  th«  fK«t«  *nd  th«  ripplinc  mcchtmtm  to  find  tuch  an  optration. 

A 

A  hsrdcr  taak  might  assiar  to  do.  A  atrongar  thaoram  might  ba  aaatar  to  prova.  Thia  ia  caiiad  "Tha  Invantor’a  Paradox'*, 
on  p(|t  t2t  of  [Poly*  57]. 
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36.  To  fill  in  txamplos  of  X, 

On*  in*ffici*nt  in*thod  it  to  •xamin*  random  txamplaa  of  Anyihinti  ehacMns  tach  by  running 
X.Dafn  to  a**  if  it  it  an  X.  Slightly  b*tt*r  it  to  rippi*  outward  from  X  in  all  dirtctiont, 
totting  til  th*  txamplat  of  th*  concoptt  Ancountarad. 

This  is  blind  generate-and-test,  and  was  (luckily)  not  needed  much  by  AM. 

37.  To  find  mor*  •xamplat  of  X  (ort  to  find  an  •xtr*m*  •xampla  of  X),  wh*n  a  nie*  big  •xampi*  it 

Known,  and  X  hat  a  raeuriiv*  definition, 

Try  to  plug  th*  Known  oxampi*  into  th*  definition  and  produce  a  timpler  one.  Repeat  thit  until 
an  example  it  produced  which  aatitfiet  th*  bace-ttep  predicate  of  th*  definition.  That 
entity  it  then  an  extreme  (boundary)  example  of  X. 

For  example,  AM  had  a  definition  of  a  set  as 

''Set($}  if  S-{}  or  if  Set(Remove-random-element(S)).''  When  AM  found  the  big  example 
{A, B,{{C},D}, {{{£}}}, F}  by  some  other  means,  it  used  the  above  rule  and  on  he  recursive 
definition  to  turn  this  into  {A,B,{{{£}}},F}  by  removing  the  randomly-chosen  third  element. 
{A,B,F}  was  produced  next,  followed  by  {B,F}  and  {F}.  After  that,  {}  was  produced  and  the 
rule  relinquished  control. 

38.  To  find  examples  of  X,  when  X  hat  a  recursive  definition. 

On*  method  with  low  tuceett  rat*  but  high  payoff  it  to  try  to  invert  that  definition,  thereby 
creating  a  procedure  for  generating  new  examples. 

Using  the  previous  example,  AM  was  able  to  turn  the  recursive  definition  of  a  set  into  the 
program  "Insert-any-random-item($)",  which  turns  any  set  into  a  (usually  different  and 
larger)  new  set.  Since  the  rules  which  AM  uses  to  do  these  transformations  are  very  special- 
purpose,  they  are  not  worth  detailing  here.  This  is  one  very  managable  open  problem, 
where  someone  might  spend  some  months  and  create  a  decent  body  of  definition-inversion 
rules.  A  typical  rule  AM  has  says: 

"Any  phrase  matching  'Removing  an  x  and  ensuring  that  PM'  can  be  inverted  and  turned 
into  this  one:  'Finding  any  random  x  for  which  PM  holds,  then  Inserting  x'."  The  class  of 
definitions  which  can  be  inverted  using  AM’s  existing  rules  is  quite  small;  whenever  AM 
needed  to  be  able  to  invert  another  particular  definition,  the  author  simply  supplied 
whatever  rules  would  be  required. 

39.  Whit*  filling  in  examples  of  C, 

if  two  constructs  x  and  y  are  found  which  are  very  similar  yet  only  on*  of  which  is  an  example 
of  th*  concept  C, 

Then  on*  is  a  boundary  example  of  C,  and  th*  other  is  a  boundary  non^example, 
and  it's  worth  cresting  mor*  boundary  examples  and  boundary  non-examples  by  slowly 
transforming  x  and  y  into  each  other. 

Thus  v/hen  AM  notices  that  {a}  and  {a,b,a}  are  similar  yet  not  both  sets,  it  creates  {a,b}, 
{b,a},  {a,a}  and  sees  which  are  and  are  not  examples  of  sets.  In  this  way,  some  boundary 
items  (both  examples  and  non-examples)  are  created.  The  rules  for  this  slow  transformation 
are  again  special  purpose.  They  examine  the  difference  between  the  items  x  and  y,  and 
suggest  operators  (e.g..  Deletion)  which  will  reduce  that  difi^erence.  This  GPS-like  strategy 
has  been  well  studied  by  others,  and  iu  inferior  implementation  inside  AM  will  not  be 
detailed. 
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40.  If  th«  main  task  now  is  to  fill  in  oxamplt!  of  concept  C, 

Consider  all  the  examples  of  "first  cou^'ins"  of  C.  Some  of  them  misht  be  examples  of  C  as 
well. 

By  "first  cousins",  wc  mean  all  direct  specializations  of  all  direct  generalizations  of  a  concept, 
or  vice  versa.  That  is,  going  up  once  along  a  Genl  link,  and  then  down  once  along  a  Spec 
link  (or  going  down  one  link  and  then  up  one  link). 

41.  If  the  main  task  now  is  to  fill  in  boundary  (non-}examples  of  concept  C, 

Consider  all  the  boundary  (non-)examples  of  "first  cousins"  of  C.  Some  of  them  mieht  lie  on 
the  boundary  of  C  as  well. 

If  they  turn  out  not  to  be  boundary  examples,  they  can  be  recorded  as  boundary  non¬ 
examples,  and  vice  versa. 

42.  To  fill  in  Isa  links  of  concept  X,  (that  is,  to  find  a  list  of  concepts  of  which  X  is  an  example), 

Just  ripple  down  the  tree  of  concepts,  applyine  a  definition  of  each  concept.  Whenever  a 

definition  fails,  don't  waste  time  trying  any  of  its  speeialiaations.  The  Isa's  of  X  are  then 
all  the  concepts  triad  whose  definitions  passed  X. 

When  a  new  concept  is  created,  e.g.,  a  new  composition,  this  rule  can  ascertain  the  most 
specific  Isa  links  that  can  be  attached  to  it.  Another  use  for  this  rule  would  be:  If  the  Isa 
link  network  ever  got  fouled  up  (contained  paradoxes),  this  rule  could  be  used  to  straighten 
everything  out  (with  a  logarithmic  expenditure  of  time).' 


Any-concept .  Examples  .  Suggest 

43.  If  some  (but  not  most)  examples  of  X  are  also  examples  of  i  (for  tome  concept  Y), 
end  some  (but  not  most)  examples  of  Y  are  also  examples  of  X, 

Create  a  new  concept  defined  as  the  intersection  of  those  two  concepts  (X  and  Y).  This  will  be 
a  specialization  of  both  concepts. 

If  you  hapen  to  notice  that  some  primes  are  palindromic,  this  rule  would  suggest  creating  a 
brand  new  concept,  defined  as  the  set  of  numbers  which  are  both  palindromic  and  prime. 
AM  never  actually  noticed  this,  since  it  represented  all  numbers  in  unary.  If  pushed,  AM 
will  define  Palindrome(n)  to  mean  that  the  sequence  of  exponents  of  prime  factors  is 
symmetric:  thus  2^3*5'?' 1 1*13^  is  palindromic  in  AM’s  sense  because  the  sequence  of  its 
exponents  (3  8  1  1  8  3)  is  unchanged  upon  reversal.  In  this  sense,  the  only  Prime 
palindromes  are  the  primes  themselves  (or:  just  *2’,  depending  upon  the  precise  definition). 

44.  If  vary  few  examples  of  X  are  found. 

Then  add  the  following  task  to  the  agenda!  "Generalize  the  concept  X",  for  the  following 
reason:  "X't  are  quite  rare}  a  slightly  less  restrictive  concept  might  be  more 
interesting". 

Of  course,  AM  contains  a  precise  meaning  for  the  phrase  "very  few".  When  AM  looks  for 
primes  among  examples  of  already-known  kinds  of  numbers,  it  will  find  dozens  of  non- 
examples  for  every  example  of  a  prime  it  uncovers.  "Very  few"  is  thus  naturally 
implemented  as  a  statistical  confidence  level.  AM  uses  this  rule  when  very  few  examples  of 
Equality  are  found  readily. 
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45.  If  vary  many  txampl«$  of  X  are  found  in  a  short  poriod  of  timo, 

Than  try  to  crtato  •  now,  tpocializod  vortion  of  X. 

This  is  similar  to  the  preceding  rule.  Since  numbers  are  easy  to  find,  this  might  cause  us  to 
look  for  certain  more  interesting  subclasses  of  numbers  to  study. 

46.  If  thoro  art  no  Known  oxamplos  for  tha  intorosting  concept  X, 

Than  considar  tpanding  soma  time  looking  for  such  axamplas. 

I’ve  heard  of  a  math  student  who  defined  a  set  of  number  which  had  quite  marvelous 

properties.  After  the  20^^  incredible  theorem  about  them  he’d  proved,  someone  noticed  that 
the  set  was  empty.  The  danger  of  unwittingly  dealing  with  a  vacuous  concept  is  even  worse 
for  a  machine  than  for  a  human  mathematician.  The  above  rule  explicitly  prevents  that. 

47.  If  tha  totality  of  axamplas  of  concept  C  is  too  small  to  ba  intarasting, 

Than  considar  thasa  raactions:  (i)  ganaraliaa  C}  (ii)  forgot  C  complataly;  (iii)  raplaca  C  by  ona 
conjactura. 

This  is  a  good  example  of  when  a  task  like  "Fill  in  ganaralizations  of  Numbars-wiih>l -divisors" 
might  get  proposed  with  a  high-priority  reason.  The  class  of  entities  which  C  encompasses 
is  simply  too  small,  too  trivial  to  be  worth  maintaining  a  separate  concept.  When  C  is 
numbers-with-l-divisor,  C  is  really  just  another  disguise  for  the  singleton  set  {1}.  The  above 
rule  might  cause  a  new  task  to  be  added  to  the  agenda,  Fill  in  ganaralizations  of  Numbars- 
with-1 -divisor.  When  that  task  is  executed,  AM  might  create  the  concept  Numbers-with- 
odd-no-of-divisors,  Numbers-with-prime-number-of-divisors,  etc.  Besides  generalizing  that 
concept,  the  above  rule  gives  AM  two  other  alternatives.  AM  may  simply  obliterate  the 
nearly-vacuous  concept,  perhaps  leaving  around  Just  the  statement "/  is  the  only  number  with 
one  divisor".  That  conjecture  might  be  tacked  onto  the  Conjees  facet  of  Divisors-of.  The 
actual  rule  will  specify  criteria  for  deciding  which  of  the  three  alternatives  to  try.  In  fact, 
AM  really  starts  all  three  activities:  a  task  will  always  be  created  and  added  to  the  agenda 
(to  generalize  C),  the  vacuous  concept  will  be  tagged  as  "forgettable",  and  AM  will  attempt 
to  formulate  a  conjecture  (the  only  items  satisfying  C.Defn  are  C.Exs). 

48.  If  fho  totslily  of  oxamplss  of  concept  C  is  too  large  to  be  interesting. 

Then  consider  theie  three  possible  reactions:  (i)  specialize  C;  (ii)  forget  C  completely;  (iii) 
replace  C  by  one  conjecture. 

This  is  analogous  to  the  preceding  rule,  but  is  used  far  less  frequently.  One  common  use  is 
when  a  disjunction  of  two  concepts  has  been  formed  which  is  accidentally  large  or  already- 
known  (e.g.,  "Evens  u  Odds"  would  be  replaced  by  a  conjecture). 

49.  After  filling  in  examples  of  C,  if  some  examples  were  found, 

Look  at  all  the  operations  which  can  be  applied  to  C's  (that  is,  access  C.ln-dom-of),  find  those 
which  are  interesting  but  which  have  no  known  examples,  and  suggest  that  AM  fill  in 
examples  for  them,  because  some  items  are  now  known  which  are  in  their  domain, 
namely  C.Exs. 

This  rule  had  AM  fill  in  examples  of  Set-insertion,  as  soon  as  some  examples  of  Sets  had 
been  found. 

50.  After  filling  in  examples  of  C,  if  some  examples  were  found, 

Consider  the  task  of  Cheeking  the  examples  facet  of  concept  C. 
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This  was  very  frequently  used  during  AM’s  runs. 

51.  Aft«r  checking  examples  of  C,  if  many  examples  remain, 

Consider  the  task  of  'Filling  in  some  Conjees  for  C. 

This  was  used  often  by  AM.  After  checking  the  examples  of  C,  AM  would  try  to 
empirically  formulate  some  interesting  conjecture  about  C. 

52.  After  successfully  filling  in  non*exampies  of  X,  if  no  examples  exist, 

If  AM  has  not  recently  tried  to  find  examples  of  X,  then  it  should  do  so. 
if  AM  has  recently  tried  and  failed  to  find  examples,  consider  the  conjecture  that  X  is  vacuous, 
empty,  null,  sIways-False.  Consider  generalizing  X. 


53.  After  trying  in  vain  to  find  some  non-examples  of  X,  if  many  examples  exist. 
Consider  the  conjecture  that  X  is  universal,  always-True.  Consider  specializing  X. 


54.  After  successfully  filling  in  examples  of  X,  if  no  non-examples  exist. 

If  AM  has  not  recently  tried  to  find  non-examples  of  X,  then  it  should  consider  doing  so. 

If  AM  has  recently  tried  and  failed  to  find  non-examples,  consider  the  conjecture  that  X  is 
universal,  always-True.  Consider  specializing  X. 


55.  After  trying  in  vain  to  find  some  examples  of  X, 

If  many  non-examples  exist. 

Consider  the  conjecture  that  X  is  vacuous,  null,  empty,  always-False.  Consider  generalizing  X. 


Any-concept  .  Examples  .  Check 


56.  If  the  current  task  is  to  Check  Examples  of  concept  X, 

and  (Forsome  Y)  Y  is  a  generalization  of  X  with  many  examples, 
and  all  examples  of  Y  (ignoring  boundary  cases)  are  also  examples  of  X, 

Then  conjecture  that  X  is  really  no  more  specialized  than  Y, 
and  Check  the  truth  of  this  conjecture  on  boundary  examples  of  Y, 

and  see  whether  Y  might  itself  turn  out  to  be  no  more  specialized  than  one  of  its 
generalizations. 

This  rule  caused  AM,  while  checking  examples  of  odd-primes,  to  conjecture  that  all  primes 
were  odd-primes. 

57.  if  the  current  task  is  to  Check  Examples  of  concept  X, 
and  (Forsome  Y)  Y  is  a  specialization  of  X, 

and  all  examples  of  X  (ignoring  boundary  cases)  are  also  examples  of  Y, 

Then  conjecture  that  X  is  really  no  more  general  than  Y, 

and  Check  the  truth  of  this  conjecture  on  boundary  examples  of  X, 

and  see  whether  Y  might  itself  turn  out  to  be  no  more  general  than  one  of  its  specializations. 

1  ms  rule  is  analogous  to  the  preceding  one  for  generalizations. 
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58.  When  checking  boundary  examples  of  a  concept  C, 
ensure  that  every  scrap  of  C.Defn  has  been  used. 

It  IS  often  the  tiny  details  in  the  definition  that  determine  the  precise  boundary.  Thus  we 
must  look  caiefully  to  see  whether  Primes  allows  1  as  an  example  or  not.  A  definition  like 
"numbers  divisible  only  by  1  and  themselves"  includes  1,  but  this  definition  doesn’t: 
"numbers  having  precisely  2  divisors".  In  the  LISP  program,  this  rule  contains  several 
hacks  (tricks)  for  checking  that  the  definition  has  been  stretched  to  the  fullest.  For  example; 
II  the  definition  is  of  the  form  "all  x  in  X  such  that...",  then  pay  careful  attention  to  the 
boundary  of  X.  That  is,  take  the  time  to  access  X.Boundary-exs  and  X.Boundary-non-exs, 
and  check  them  against  C.Defn. 

59.  Wh«n  checking  examples  of  C, 

Ensure  that  each  example  satisfiec  C.Defn,  and  each  non-example  fails  it.  The  precise  member 
of  C.Defn  to  use  can  be  chosen  depending  on  the  example. 

As  described  earlier  in  the  text,  definitions  can  have  descriptors  which  indicate  what  kinds 
of  arguments  they  might  be  be.st  for,  their  overall  speed,  etc. 

60.  When  checking  examples  of  C, 

If  an  entry  e  is  rejected  (i.e.,  it  is  seen  to  be  not  an  example  of  C  after  all),  then  remove  e 
from  C.Exs  and  consider  inserting  it  on  the  Boundary  non-i.>xamples  facet  of  C. 

There  is  a  complicated®  algorithm  for  deciding  whether  to  forj  '  e  entirely  or  to  keep  it 
around  as  a  close  but  not  close  enough  kind  of  example. 

61.  When  checking  examples  of  C, 

After  an  entry  e  has  been  verified  as  a  bone  fide  example  of  C, 

Check  whether  e  is  also  a  valid  example  of  some  direci  specialization  of  C. 

If  it  is,  then  remove  It  from  C.Exs,  and  consider  adding  it  to  the  examples  facet  of  that 
specialization,  and  luggeit  the  task  of  Checking  examples  of  that  specialization. 

62.  When  checking  examples  of  C, 

If  an  entry  e  is  rejected. 

Then  check  whether  e  is  nevertheless  a  valid  example  of  some  generalization  of  C. 

If  it  is,  consider  adding  it  to  that  concept's  boundary-examples  facet,  and  consider  adding  it  to 
the  boundary  non-examples  facet  of  C. 

This  is  similar  to  the  preceding  rule. 

63.  When  checking  non-examples  of  C,  including  boundary  non-examples. 

Ensure  that  each  one  fails  a  definition  of  C.  Otherwise,  transfer  it  to  the  boundary  examples 
facet  of  C. 


Nol  rir.:«t$«rily  tophiilicfltd  First,  AM  scettMt  th«  Worth  of  C  From  this  it  dttermms*  how  many  boundary  non- 
•namplot  C  dtforvas  to  kaop  around  (and  how  many  total  hat  calls  it  merits).  AM  compares  these  quotas 
with  the  current  number  of  (and  size  of)  entries  already  listed  on  Cbdy-non-exs.  The  degree  of  need  of 
another  entry  there  then  eets  the  ’odds*  for  insertion  versus  forgetting.  Finally  a  random  number  is 
computed,  end  the  odds  determine  what  range  it  must  lie  in  for  e  to  be  remembered. 
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64.  Wh«n  checking  novexamples  of  C,  including  boundir/  non-examples, 

After  an  entry  *  has  been  verified  is  ■  bone  fide  non-example  of  C, 

Check  whethctr  e  is  also  a  non-example  of  some  direct  generalization  of  C. 

If  it  is,  then  remove  it  from  C.Non-Exs,  and  consider  adding  it  to  the  non-examples  facet  of 
that  generalization,  and  suggest  the ‘task  of  Checking  examples  of  that  generalization. 


65.  When  checking  (boundary)  non-examples  of  C, 

If  an  entry  e  is  rejected,  that  is  if  it  turns  out  to  be  an  example  of  C  after  all. 
Then  check  whether  e  is  nevertheless  a  non-example  of  some  specialization  of  C. 
If  ft  is,  consider  adding  it  to  that  concept’s  boundary  non-examples  facet. 

This  is  similar  to  the  preceding  rule. 


Any-concept .  Conjees  .  Fillin 

When  the  task  is  to  look  around  and  find  conjectures  dealing  with  concept  C,  the  following 
genera)  rules  may  be  useful. 

66.  if  there  is  an  analogy  from  X  to  C,  and  a  nice  item  in  X.Conjecs,  formulate  anti  test  the 

analogous  conjecture  for  C. 

Since  an  analogy  is  not  much  more  than  a  set  of  substitutions,  formulating  the  'analogous 
conjecture’  is  almost  a  purely  syntactic  transformation. 

67.  Examine  C.Exs  for  regularities. 

What  mysteries  are  lurking  in  the  LISP  code  for  thli  rule,  you  ask?  Nothing  but  a  few 
special-purpose  hacks  and  a  few  uitra-general  hacks.  Here  is  a  slightly  more  specific  rule  for 
you  seekers: 

68.  Look  at  C.Exs.  Pick  one  element  at  random.  Write  down  statements  true  about  that  example  e. 

Include  a  list  of  all  concepts  of  which  it  is  an  example,  all  Interests  features  it  satisfies, 
etc. 

Then  check  each  conjecture  on  this  list  against  all  other  known  examples  of  C.  If  any  example 
(except  a  boundary  example)  of  C  violates  a  conjecture,  discard  it. 

Take  all  the  surviving  conjectures,  snd  eliminate  any  which  trivally  follow  from  other  ones. 

This  is  a  common  way  AM  us''s;  induce  a  conjecture  from  one  example  and  test  it  on  all 
the  rest.  A  more  sophisticated  approach  might  be  to  induce  it  by  using  a  few  examples 
simultaneously,  but  I  haven’t  thought  of  any  nontrivial  way  to  do  that.  The  careful  reader 
will  perceive  that  most  of  the  conjectures  AM  will  derive  using  this  heuristic  will  be  of  the 
form  "X  is  unexpectedly  a  specialization  of  Y",  or  "X  is  unexpectedly  an  example  of  Y",  etc. 
Indeed,  most  of  AM’s  conjectures  are  really  that  simple  syntactically. 

69.  Kormulst*  a  psrameisrizsd  conjseturs,  a  "tamplats",  which  gats  slowly  spacializad  or 

instantiaiad  into  a  dafinita  coiijactura. 
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AM  has  only  a  few  trivial  methods  for  doing  this  (e.g.,  introduce  a  variable  initially  and 
find  the  constant  value  to  plug  in  there  later).  As  usual,  they  will  be  omitted  here,  and  the 
author  encourages  some  research  in  this  area,  to  turn  out  a  decent  set  of  general  rules  for 
accomplishing  this  hypothesis  template  instantiation.  The  best  effort  to  date  along  these 
lines,  in  one  specific  sophisticated  scientific  field,  is  that  of  META-DENDRAL  [Buchanan]. 


Any-concept .  Conjees  .  Check 


70.  If  a  univarsal  conjaciura  (For  all  X't,  ...)  is  contradictad  by  ampirical  data,  gather  tha  data 

togathar  and  try  to  find  a  ragularity  in  thosa  axcaptions. 

If  this  succaads,  giva  tha  axcaptions  a  name  N  (if  they  aren’t  already  a  concept),  and  rephrase 
tha  conjecture  (For  all  X's  which  are  not  N's...).  Consider  making  X-N  a  new  concept. 

I 

.Again  note  how  "active"  this  little  checking  rule  can  be.  It  can  patch  up  nearly-true 
conjectures,  examine  data,  define  new  concepts,  etc. 

7 1 .  After  verifying  a  conjecture  for  concept  C, 

See  if  it  also  holds  for  related  concepts  (e.g.,  a  generalization  of  C). 


There  are  of  course  bookeeping  details  not  explicitly  shown  above,  which  are  present  in  the 
LISP  program.  For  example,  if  conjecture  X  is  true  for  all  specializations  of  C,  then  it  must 
be  added  to  C.Conjecs  and  removed  from  the  Conjees  facets  of  each  specialization  of  C. 


Any-concept .  Conjees  .  Suggest 

72.  If  X  is  probably  ralatad  to  Y,  but  no  definite  connection  is  known. 

It’s  worthwhile  looking  for  a  specific  conjecture  tying  X  and  Y  together. 

How  might  AM  know  that  X  and  Y  are  only  probably  related?  X  and  Y  may  play  the  same 
role  in  an  analogy  (e.g.,  the  singleton  bag  "(T)"  and  "any  typical  singleton  bag"  share  many 
properties),  or  they  may  both  be  specializations  of  the  same  concept  Z  (e.g.,  two  kinds  of 
numbers),  or  they  may  both  have  been  created  in  the  same  unusual  way  (e.g.,  Pius  and 
Times  and  Exponentiation  are  all  creatable  by  repeating  another  operation). 


Any-concept .  Conjees  .  Interest 

73.  A  conjociuro  sbout  X  is  inUrosting  if  X  is  vory  inUrosting. 


74.  A  nonconstructiv*  oxistenco  conjoctur*  is  interesting. 

Thus  the  unique  factorization  theorem  is  judged  to  be  interesting  because  it  merely 
guarantees  that  some  factoring  will  be  into  primes.  If  you  give  an  algorithm  for  that 
factoring,  then  the  theorem  actually  loses  its  mystique  and  (according  to  this  rule)  some  of 
its  value.  But  it  increases  in  value  due  to  the  next  rule. 


A 
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75.  A  constructive  existence  conjecture  it  interesting  if  it  is  frequently  used. 


76.  A  conjecture  C  about  X  is  interesting  if  the  origin  and  the  verification  of  C  for  each 
specialization  of  X  was  quite  independent  of  each  other,  and  preceded  C’s  being  noticed 
applicable  to  all  X's. 

This  would  be  even  more  striking  if  proof  techniques  were  known,  and  each  specialized  case 
had  a  separate  kind  of  proof.  Many  number  theory  results  are  like  this,  where  there  exists 
a  general  proof  only  for  numbers  bigger  than  317,  say,  and  all  smaller  numbers  are  simply 
checked  individually  to  make  sure  they  satisfy  the  conjecture.  Category  theory  is  built  upon 
practically  nothing  but  this  heuristic. 


Appendix  3.2.4.  Heuristics  for  the  Analogies  facet  of  Anv-concept 


Any-concept .  Analogies .  Fillin 

77.  To  fill  in  conjectures  involving  concept  C,  where  C  is  analogous  to  D, 
Consider  the  analogue  of  each  conjecture  involving  D. 


78.  if  the  current  task  involves  a  specific  analogy,  and  the  request  is  to  find  more  conjectures. 

Then  consider  the  analog  of  each  interesting  conjecture  about  any  concept  involved  centrally 

in  the  analogy. 

That  is,  this  rule  suggests  applying  the  preceding  rule  to  each  concept  which  is  central  to 
the  given  analogy.  The  result  is  a  flood  of  new  conjectures.  There  is  a  tradeoff  (explicitly 
taken  into  account  in  the  LISP  version  of  this  rule)  between  how  interesting  a  conjecture 
has  to  be,  and  how  centrally  a  concept  has  to  fit  into  the  analogy,  in  order  to  determine 
what  resources  AM  should  be  willing  to  expend  to  find  the  analogous  conjecture.  Note  that 
this  is  not  a  general  suggestion  of  what  to  do,  but  a  specific  strategy  for  enlarging  the 
analogy  itself.  If  the  new  conjecture  is  verified,  then  not  only  would  it  be  entered  under 
some  Conjees  facet,  but  it  would  also  go  to  enlarging  the  data  structure  which  represents  the 
analogy. 

79.  Let  tha  analogy  suggest  how  to  specialize  and  generalize  each  concept  into  what  is  at  least 

the  analog  of  a  Known,  very  interesting  concept. 


Like  the  last  rule,  this  one  simply  says  to  use  the  analogy  itself  as  the  "reason"  for  exploring 
certain  new  entities,  in  this  case  new  concepts.  When  the  Bags«Numbers  analogy  is  made, 
AM  notices  that  Singleton  bags  and  Empty  bags  arc  two  interesting,  extreme  specializations 
of  Bags.  The  above  rule  then  allows  AM  to  construct  and  study  what  we  know  and  love  as 
the  numbers  one  and  zero.  The  analogy  is  flawed  because  there  is  only  one  "one",  although 
there  are  many  different  singleton  bags.  But  just  as  singletons  and  empty  bags  have  special 
properties  under  bag  opei.itions,  so  do  0,1  under  numeric  operations.  This  was  one  case 
where  an  analogy  paid  off  handsomely.  ' 
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80.  If  it  is  desired  to  have  an  analogy  between  concepts  X  and  Y,  then  look  for  two  already-known 
analogies  between  X«*Z  and  Z«Y,  for  any  Z. 

If  found,  compose  the  two  analogies  and  see  it  the  resultant  analogy  makes  sense. 


Since  the  analogies  are  really  just  substitutions,  composing  them  has  a  familiar,  precise 
meaning.  This  rule  was  never  really  used  by  AM,  due  to  the  paucity  of  analogies.  The  user 
can  push  AM  into  creating  more  of  them,  and  ultimately  using  this  rule.  A  chain  from 
X«ZwY«X  can  be  found  which  presents  a  new,  bizarre  analogy  from  X  to  itself. 


Any-concept  .  Analogies  .  Suggest 


81.  If  an  analogy  is  strong,  and  on«  concapt  has  a  v«ry  intarasting  universal  conjactura  C  (For  all 

X  in  B...),  but  the  analog  conjactura  C  is  falsa. 

Than  it's  worth  constructing  the  sat  of  itams  in  B'  for  which  tha  conjactura  holds.  It’s  parhaps 
avan  mora  intarasting  to  isolate  tha  sat  of  axcaptional  alamants. 

With  the  Add«Times  analogy,  it’s  true  that  all  numbers  n>l  can  be  represented  as  the  sum 
of  two  other  numbers  (each  of  them  smaller  than  n),  but  it  is  not  true  that  all  numbers  (with 
Just  a  couple  exceptions)  can  be  represented  as  the  product  of  other  (hence  smaller) 
numbers.  The  above  rule  has  AM  define  the  set  of  numbers  which  can/can’t  be  so 
represented.  These  are  just  the  composite  numbers  and  the  set  of  primes.  This  second  way 
of  encountering  primes  was  very  unexpected  -  both  by  AM  and  by  the  author.  It  expresses 
the  deep  fact  that  one  difference  between  Add  and  Times  is  the  presence  of  primes  only  for 
multiplication.  At  the  time  of  its  discovery,  AM  didn’t  appreciate  this  fully  of  course, 

82.  If  space  is  tight,  and  no  use  of  the  analogy  has  ever  been  made,  and  it  is  very  old,  and  it  takes 

up  a  lot  of  space. 

Then  it  is  permissable  to  forget  it  without  a  trace. 


83.  If  two  valuable  conjectures  are  syntactically  identical,  and  can  be  made  identical  by  a  simple 

substitution,  then  tentat  vely  consider  the  analogy  which  is  that  very  substitution. 

Thus  the  associative/commutative  property  of  Add  and  Times  causes  them  to  be  tied 
together  in  an  analogy,  because  of  this  rule. 

84.  if  an  analogy  is  very  interesting  and  very  complete. 

Then  spend  some  time  refining  it,  looking  for  small  exceptions.  If  none  are  found,  see  whether 
the  two  situations  are  genuinely  isomorphic. 


85.  If  concepts  X  and  Y  are  analogous,  look  for  analogies  between  their  specializations,  or 
between  their  generalizations. 

This  i-ule  is  not  used  much  by  AM,  although  the  author  thought  it  would  be. 


^  ^  *-^  v"  V  ^  «r  Kr  h.^  5 
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86.  An  analogy  which  has  no  diserapancias  whaitoavar  is  not  as  intarasting  as  a  slightly  flawad 
analogy. 


87.  An  analogy  is  intarasting  if  it  associatas  (for  tha  first  tima)  two  concapts  which  ara  aach 
unusally  fully  fillsd  out  (having  many  conjacturas,  many  axamplas,  many  intarast 
faaturas,  ate.). 


Appendix  3.2.5.  Heuristics  for  the  Genl/Spec  facets  of  Anv-concapt 
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88.  To  fill  in  specializations  of  X,  if  it  was  vary  aasy  to  find  axamplas  of  X, 

Grab  soma  faaturas  which  would  indicata  than  an  X  was  intarasting  (soma  antrias  from 
X.Intarest,  or  more  ramota  Intarast  predicates  garnered  by  rippling),  and  conjoin  them 
onto  the  definition  of  X,  thereby  creating  a  new  concept. 

Here’s  one  instance  where  the  above  rule  was  used:  It  was  so  easy  for  AM  to  produce 
examples  of  sets  that  it  decided  to  specialize  that  concept.  The  above  rule  then  plucked  the 
interestingness  feature  "all  pairs  of  members  satisfy  the  same  rare  predicate"  and  conjoined 
It  to  the  old  definition  of  Set:.  The  new  concept,  Interesting-sets,  included  all  singletons 
(because  all  pairs  of  members  drawn  from  a  singleton  satisfy  the  predicate  Equal)  and 
empty  sets. 

89.  To  fill  in  generalizations  of  concept  X, 

Take  the  definition  e,  and  replace  it  by  a  generalization  of  e.  If  e  is  a  concept,  use  e.GenI;  if 
e  is  a  conjunction,  then  remove  a  conjunct  or  generalize^  a  conjunct;  if  e  is  a 
disjunction,  then  add  a  disjunct  or  generalize  a  disjunct;  if  e  ismegated,  then  specialize 
the  negate;  if  e  is  an  example  of  E,  then  replace  e  by  "any  example  of  E";  if  e  satisfies 
any  property  P,  then  replace  e  by  "anything  satisfying  P";  if  e  is  a  constant^,  then 
replace  e  by  a  new  variable  (or  an  existing  one)  which  could  assume  value  e;  if  e  is  a 
variable,  then  enlarge  its  uepe  of  possible  bindings. 

This  rule  contains  a  bag  of  tricks  for  generalizing  any  LISP  predicate,  the  definition  of  any 
concept.  They  are  all  syntactic  tricks,  however. 

90.  To  fill  in  generalizations  of  concept  X,  If  some  conjecture  exists  about  "all  X's  and  Y’s”  or  "in 

X  or  Y",  for  some  other  concept  Y, 

Create  a  new  concept,  a  generalization  of  both  X  and  Y,  defined  as  their  disjunction. 


e 

I*,  racur. 

^  Of  eourae  H’a  unliktiy  that  a  concapt  ia  dafkwd  aimpty  at  a  conatant,  but  tha  pracading  footnota  ahowa  that  thia  littia 
program  can  ba  antarad  racuraivaty,  baing  fad  a  aub-axpraaaion  of  tha  dafinition. 
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This  rule  contains  another  trick  for  generalizing  any  concept,  although  it  is  more 
meaningful,  more  semantic  than  the  previous  rule’s  tricks.  Many  theorems  are  true  about 
numbers  with  1  or  2  divisors,  so  this  might  be  one  reasonable  way  to  generalize  Numbers- 
with-l-di visor  into  a  new  useful®  concept. 

91.  To  fill  in  generalizations  of  concept  X, 

If  other  generalizations  Gl,  G2  of  X  exist  but  are  TOO  general, 

Create  a  new  concept,  a  generalization  of  X  and  a  specialization  of  both  Gl  and  G2,  defined  as 
the  conjunction  of  Gl  and  G2's  definitions. 

Thus  when  AM  generalizes  Reverse-all-levels  into  Reverse-top-level  and  Reverse-first- 
element,  the  above  tuie  causes  AM  to  create  a  new  operation,  which  reverses  the  top  level 
and  which  reverses  the  CAR®  of  the  original  list.  While  not  particularly  useful,  the  reader 
ihould  observe  that  it  is  in  fact  midway  in  generality  between  the  original  Reverse  i unction 
and  the  first  two  generalizations. 

32  To  fill  m  specializations  of  concept  X, 

Take  iite  definition  e,  and  replace  it  by  a  specialization  of  e.  If  e  is  a  concept,  use  e.GenI;  if  e 
is  a  disjunction,  then  remove  a  disjunct  or  specialize  a  disjunct;  if  e  is  a  conjunction, 
then  add  a  conjunct  or  specialize  a  conjunct;  if  e  is  negated,  then  generalize  the 
negate;  if  e  is  "any  example  of  E",  then  replace  e  by  a  particular  example  of  E;  if  e  is 
"anything  satisfying  P",  then  replace  e  by  a  particular  satisfier  of  P;  if  e  is  a  variable, 
then  replace  it  by  a  well-chosen  constant  or  restrict  its  scope. 

This  rule  contains  a  bag  of  tricks  for  specializing  any  LISP  predicate,  the  definition  of  any 
concept.  They  are  all  syntactic  tricks,  however.  Note  that  the  Lisp  code  for  this  rule  will 
typically  call  itself  (recur)  in  order  to  specialize  small  pieces  of  the  original  definition. 

93.  To  fill  in  specializations  of  concept  X,  if  some  conjecture  exists  about  "ali  X's  which  are  also 

Y’s"  or  "in  X  and  V",  for  some  other  concept  Y, 

Create  a  new  concept,  a  specialirviion  of  both  X  and  Y,  defined  as  their  conjunction. 

This  rule  contains  another  trick  for  specializing  any  concept,  although  it  is  more 
meaningful,  more  sentantic  than  the  previous  rule’s  tricks.  Many  theorems  about  primes 
contain  the  condition  "p>2";  i.e.,  they  are  really  true  about  primes  which  are  odd.  So  this 
might  be  one  reasonable  way  to  specialize  Primes  into  a  new  concept:  by  conjoining  the 
definitions  of  Primes  and  Odd-numbers,  into  the  new  concept  Odd-primes.  Here’s  anothei 
usage  of  this  rule:  If  AM  had  originally  defined  Primes  to  include  ‘1’,  then  the  frequency  of 
conjectures  where  1  was  an  exception  would  trigger  this  rule  to  define  Primes  more  normally 
(p>2). 

94.  To  fill  in  specializations  of  concept  X, 

If  other  specializations  $1,  $2  of  X  exist  but  are  TOO  restrictive  to  be  interesting. 

Create  a  new  concept,  a  specialization  of  X  and  a  generalization  of  both  SI  and  S2,  defined  as 
the  disjunction  of  Si  and  S2’s  definitions. 


«l  I***!,  Mvtril  lh«or«m*  will  bt  tUltd  mors  concittly  u«inf  this  n«w  conctpt:  Numbtrt  with  1  or  2  divitort. 
g 

*l<o  th«  CAR  of  th«  CAR,  otc ,  until  *  non-li«t  i*  oncounttrod. 
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95.  To  fill  in  gontralizations  of  concapt  X,  whan  a  racuraiva  dafinition  of  X  axiiti, 

If  fha  definition  contains  two  eonjoinad  racuraiva  calls,  replaea  them  by  a  disjunction  or 
eliminate  one  call  entirely. 

If  there  is  only  one  recursive  call,  disjoin  a  second  call,  this  one  on  a  different  destructive 
function  applied  to  the  original  argument.  If  the  original  destructive  function  is  one  of 
{CAR,CDR},  then  let  the  new  one  be  the  other  member  of  that  pair. 

AM  uses  the  first  part  of  this  rule  to  turn  Equal-lists  into  two  variants  of  Same-length-as. 
The  second  part,  while  surprisingly  unused,  could  work  on  this  definition  of  MEMBER:  "A 
(x,L)  LISTP(L)  and:  [x=CAR(L)  or  MEMBER(x,CDR{L»]",  which  is  just  "membership  at  the  top 
level  or,  or  (,  and  transform  it  into  this  one  of  MEM,  which  represents  membership  at  any 
depth:  "Mx.D  LISTP'°(L)  and:  [x-CAR(L)  or  MEM(x,CDR(L))  or  MEM{x,CAR(L))]".  The  rule 
noticed  a  recursive  call  on  CDR(L),  and  simply  disjoined  a  recursive  call  on  CAR(L). 

96.  To  fill  in  specializations  of  concept  X,'when  a  recursive  definition  of  C  exists. 

If  the  definition  contains  two  disjoined  recursive  calls,  replace  them  by  a  conjunction  or 
eliminate  one  call  entirely. 

If  there  is  only  one  recursive  call,  conjoin  a  second  on  another  destructive  function  applied  to 
the  original  argument.  Often  the  two  recursions  will  be  on  the  CAR  and  the  COR  of  the 
original  argument  to  the  predicate  which  is  the  definition  for  X. 

This  is  closely  related  to  the  preceding  rule.  Just  as  it  turned  the  concept  of  ‘element  of’  into 
the  more  general  one  of  'membership  at  any  depth’,  the  above  rule  can  specialize  the 
definition  of  MEMBER  into  this  one,  called  AMEM:  (x,L)  LISTP(L)  and:  [x-CARd)  or; 

[AMEM(x,CDR(L))  and  AMEM(x,CAR(L))]]".“ 

97.  To  fill  in  specializations  of  concept  X, 

Find,,  within  a  definition  of  X  (at  even  parity  of  NOT’s),  an  expression  of  the  form  "For  some  x 
in  X,  P(xr,  and  replace  it  either  by  "For  all  x  in  X,  P(x)",  or  by  Pfx^). 

Thus  "sets,  all  pairs  of  whose  members  satisfy  SOME  interesting  predicate"  gets  specialized 
into  "sets,  all  pairs  of  whose  members  satisfy  Equality".  The  same  rule,  with  "even  parity" 
replaced  by  "odd  parity",  is  useful  for  generalizing  a  definition.  This  rule  is  really  A 
separate  rules,  in  the  LISP  program.  The  same  rule,  with  the  transformations  going  in  the 
opposite  direction,  is  also  used  for  generalizing.  The  same  rule,  with  the  transformations 
reversed  and  the  parity  reversed,  is  used  for  specializing  a  definition.  Here  is  that  doubly- 
switched  rule: 

98.  To  fill  in  specializations  of  concept  X, 

Find  within  a  definition  of  X  (at  odd  parity  of  NOT’s)  an  expression  of  the  form  "For  all  x  in  X, 
P(x)",  and  replace  it  either  by  "For  some  x  in  X,  P(x)",  or  by  Pfx^j).  Or  replace  "P(o4)", 

where  ec  is  a  constant,  by  "For  some  x  in  A,  P(x)”  where  A  is  a  concept  of  which  oc  is 
ont  .example. 


The  Interhep  function  LISTP(L)  te«t«  whether  or  not  I  le  t  (nonnull)  lot. 

*  *  This  operation  ie  almoat  impottible  to  explain  verbally.  AMEM(x,l)  maan*  that  x  it  an  alemant  of  L,  and  for  each  member 
M  of  L  before  the  x,  M  it  an  ordered  atructura  and  x  ia  an  alamant  of  M,  and  for  each  membar  N  of  M 
before  the  x  which  ia  inaida  M^.  etc.  E.f ,  <(x]  (  <<x  a  b>  <x>  x  d  e>  <x  f>  x  |  h  ]  [<x  i>  x  j]  x  k  [I]  m>. 


Appendix  3 


AM  Oieeovtry  in  Mithemttice  •«  Heurietic  Search 


-247- 


99.  Whttn  creating  in  a  specialization  $  of  concept  C, 

Note  that  S.Geni  should  contain  C,  and  that  C.Spec  should  contain  $. 

The  analogous  rule  exists,  in  which  all  spec  and  geni  are  switched. 


Any-concept .  Genl/Spec  .  Suggest 

100.  After  creating  a  new  specialization  S  of  concept  C, 

Explicitly  look  for  ties  between  S  and  other  known  specializations  of  C. 

For  example,  after  AM  defines  the  new  concept  of  Numbers-with-3-divisors,  it  looks  around 
for  ties  between  that  kind  of  number  and  other  kinds  of  numbers. 

101.  After  creating  a  new  generalization  G  of  concept  C, 

Explicitly  look  for  ties  between  G  and  other  close  generalizations  of  C. 

For  example,  AM  defined  the  new  predicates  Same-size-CARs  and  Same-size-CDRs*^  as 
two  generalizations  of  Equality.  The  above  rule  then  suggested  that  AM  explicitly  try  to 
find  some  connection  between  these  two  new  predicates.  Although  AM  (ailed,  Don  Knuth 
(using  a  similar  heuristic,  perhaps)  also  looked  for  a  connection,  and  found  one:  it  turns  out 
that  the  two  predicates  are  both  ways  of  defining  the  relation  we  intuitively  understand  as 
"having  the  same  length  as". 

102.  After  ertating  a  new  ipacialization  S  of  concept  C, 

Consider  looking  for  examples  of  S. 

This  has  to  be  said  explicitly,  because  all  too  often  a  concept  is  specialized  into  vacuousness. 

1 03.  After  creating  a  new  generalization  G  of  concept  C, 

Consider  looking  for  non-examples  of  G. 

This  has  to  be  said  explicitly,  because  all  too  often  a  concept  is  generalized  into  vacuous 
universality.  This  rule  is  less  useful  to  AM  than  the  preceding  one. 

104.  if  concept  C  possesses  some  very  interesting  property  lacked  by  one  of  its  specializations  S, 
Then  considering  creating  a  concept  intermediate  in  specialization  between  C  and  S,  and  see 

whether  that  possesses  the  property. 

This  rule  will  trigger  whenever  a  new  generalization  or  specialization  is  created. 

105.  If  concept  S  is  now  very  interesting,  and  $  was  created  as  a  specialization  of  some  earlier 

concept  C, 

Give  extra  consideration  to  specializing  $,  and  to  specializing  concept  C  again  (but  in 
different  ways  than  ever  before). 


1 2 

Two  litt*  (tliofy  Stmt-tixo-COR*  iff  thoy  htvt  th«  •tmo  numbtr  of  mombtrt.  Two  li*l«  ttlisfy  S«m*-«ixo-CAR*  iff 
(whtft  wrHton  out  in  itandird  .LISP  notation)  thay  hava  the  aama  number  of  initial  left  parentheaea  and  alao 
have  the  aame  firat  identifier  fellewin|  that  laat  initial  left  parentheaia. 
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The  next  rule  is  the  analog  of  the  preceding  one.  They  incorporate  tiny  bits  of  the 
strategies  of  hill-climbing  and  learning  from  one’s  successes. 

106.  If  concept  G  is  now  very  interesting,  and  G  was  created  as  a  generaiization  of  some  earlier 

concept  C, 

Give  extra  consideration  to  generalizing  G,  and  to  generalizing  C  in  other  ways. 

The  analogous  rules  exist,  for  concepts  that  have  become  so  boring  they’ve  just  been 
discarded: 

107.  If  concept  X  proved  to  be  a  dead-end,  and  X  was  created  as  a  generalization  of 

(specialization  of)  some  earlier  concept  C, 

Give  less  consideration  to  generalizing  (specializing)  X,  and  to  generalizing  (specializing)  C  in 
other  ways  in  the  future. 


Any-concept .  Genl/Spec  .  Check 


108.  When  cheeking  a  generalization  G  of  concept  C, 

Specifically  test  to  ensure  that  G  is  not  equivalent  to  C. 

The  easiest  way  is  to  examine  the  non-examples  (especially  boundary  non-examples)  of  C, 
and  look  for  one  satisfying  G;  or  examine  the  examples  of  G  (esp.  boundary)  and  look 
for  one  failing  to  satisfy  C. 

If  they  appear  to  be  the  same  concept,  look  carefully  at  G.  Are  there  any  specializations  of  G 
whose  examples  have  never  been  filled  in?  If  so,  then  by  all  means  suggest  looking  for 
such  concepts*  examples  before  concluding  that  G  and  C  are  really  equivalent. 

If  they  are  the  same,  then  replace  one  by  a  conjecture. 

If  they  are  different,  make  sure  that  some  boundary  non-example  of  C  (which  is  an 
example  of  G)  is  explicitly  stored  for  C. 

This  rule  makes  sure  that  AM  is  not  deluding  itself.  When  AM  generalizes  Numbers-with- 
1-divisor  into  Numbers-which-equal-their-no-of-divisors,  it  still  hasn’t  gotten  past  the 
singleton  set  {!}.  The  conjecture  in  this  case  would  be  "The  only  number  which  equals  Us  own 
number  of  divisors  is  Typically,  when  a  generalization  G  of  C  turns  out  to  be  equivalent 
to  C,  there  is  theorem  lurking  around,  of  the  form  "All  G’s  also  .satisfy  this  property...", 
where  the  property  is  the  "extra"  constraint  present  in  C’s  definition  but  absent  from  G’s. 
This  rule  also  was  used  when  AM  had  Just  found  some  examples  of  Sets.  AM  almost 
believed  that  all  Unordered-Structures  were  also  Sets,  but  the  last  main  clause  of  the  rule 
had  AM  notice  that  Bags  is  a  specialization  of  Unordered-structures,  and  that  the  latter 
concept  had  never  had  any  of  its  examples  filled  in.  As  a  result,  AM  printed  out  this 
message:  "Almost  concluded  that  Unordered-structures  are  also  always  Sets.  But  will  wait 
until  examples  of  Bags  are  found.  Perhaps  some  Bags  will  not  be  Sets."  In  fact,  examples 
of  Bags  are  soon  found,  and  they  aren’t  sets. 

109.  Whtn  checking  •  specialization  S  of  concept  C, 

Specifically  test  to  ensure  that  $  is  not  equivalent  to  C. 

If  they  are  the  same,  than  replace  one  by  a  conjecture. 

If  they  are  different,  make  sura  that  some  boundary  example  of  C  (which  is  not  an 
example  of  S)  is  explicitly  stored  for  C. 
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This  rule  is  similar  to  the  preceding  one.  If  adding  a  new  constraint  P  to  the  definition 
doesn’t  change  the  concept  C,  then  there  is  probably  a  theorem  there  of  the  form  "All  C’s 
also  satisfy  constraint  P". 

1 10.  When  eheekini  a  ipeciilizalion  $  of  a  tpecielizition  X  of  a  concept  C, 
if  there  exist  other  specs,  of  specs,  of  C, 

then  ensure  that  none  of  them  are  the  same  as  S.  This  is  especially  worthwhile  if  the 
specializing  operators  in  each  case  were  the  same  but  reversed  in  order. 

Thus  we  can  add  a  constraint  to  C  and  collapse  the  first  two  arguments,  or  we  can  collapse 
the  arguments  and  add  the  constraint;  either  way,  we  get  to  the  same  very  specialized  new 
concept.  The  above  rule  helps  detect  those  accidental  duplicates.  £.g.,  Coalesced-Dom-Ran- 
Compositions  are  really  the  same  as  Dom-Ran-Coalesced-Compositions,  and  this  rule  would 
suspect  that  they  might  be. 

11  1.  When  chocking  the  GenI  or  Spec  facet  entries  for  concept  C, 

ensure  that  C.GenI  and  C.Spec  have  no  common  member  Z.  If  they  do,  then  conjecture  that  C 
and  Z  are  actually  equivalent. 

In  fact,  this  rule  actually  ensures  that  Ceneralizations(C)  does  not  intersect 
Speciaiizations(C).  If  it  does,  a  whole  'cycle*  of  concepts  exists  which  can  be  collapsed  into 
one  single  concept.  See  also  rule  114,  below. 


Any-concept .  Genl/Spec  ,  Interest 

112.  A  generalization  Of  X  is  Interesting  if  all  the  previously-known  boundary  non-oxamplet  are 

now  boundary  examples  of  the  concept. 

A  check  is  included  here  to  ensure  that  the  new  concept  was  not  simply  defined  as  the 
closure  of  the  old  one. 

113.  A  specialization  of  X  is  interesting  if  all  the  previously-known  boundary  examples  are  now 

boundary  non-examples  of  the  new  specialized  concept. 

A  check  is  included  here  to  ensure  that  the  new  concept  was  not  simply  defined  as  the 
interior  of  the  old  one. 

1 14.  If  Cl  is  a  generalization  of  C2,  which  is  a  generalization  of  C3,...,  which  is  a  generalization  of 

CJ,  and  it  has  just  been  learned  that  Cj  is  a  generalization  of  Cl, 

Then  all  the  concepts  Cl,...,Cj  are  equivalent,  and  can  be  merged,  and  the  combined  concept 
will  be  much  more  interesting  than  any  single  one,  and  the  interestingness  of  the  new 
composite  concept  increases  rapidly  with  j. 

The  Lisp  code  has  the  new  interest  value  be  computed  as  the  maximum  value  of  the  old 
concepts,  plus  a  bonus  which  increases  like  the  square  of  j.  This  is  similar  to  rule  number 
28.  A  rule  Just  like  the  preceding  one  exists,  with  'Specialization’  substituted  everywhere  for 
'Generalization*.  Thus  a  closed  loop  of  -Spec  links  constitutes  a  demonstration  that  all  the 
concepts  in  that  loop  are  equivalent.  These  rules  were  used  more  frequently  than  expected. 
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Appendix  3.2.6.  Heuristics  for  the  View  facet  of  Anv-concept 


Any-concept .  View  .  Fillin 

1 15.  To  fill  in  Vitw  facot  ontrios  for  X, 

Find  an  inttrostinf  optraiion  F  whota  rang*  is  X, 

and  indicata  that  any  mambar  of  Domain(F)  can  ba  viawad  as  an  X  just  by  running  F  on  it. 

While  trying  to  fill  in  the  View  facet  of  Even-nos,  AM  used  this  rule.  It  located  the 
operation  Doubling,  whose  domain  is  Numbers  and  whose  range  is  Even-nos.  Then  the  rule 
created  a  new  entry:  "to  view  any  number  as  if  it  were  an  even  number,  double  it".  This  is  a 
twisted  affirmation  of  the  standard  correspondence  between  natural  numbers  and  even 
natural  numbers. 


Appendix  3.2.7.  Heuristics  for  the  In-dom/ran-of  facets  of  Anv-concept 


Any-concept .  In-dom-of/In-ran-of  .  Fillin 

1 16.  To  fill  in  antrias  for  tha  In-dom-of  facet  of  concept  X, 

Ripple  down  tha  tree  of  concepts,  starting  at  Active,  to  empirically  determine  which  active 
concepts  can  be  run  on  X*s. 

This  can  usually  be  decided  by  inspecting  the  Domain/range  facets  of  the  Active  concepts. 
Occasionally,  AM  must  actually  try  to  run  an  active  on  sample  X*s,  to  see  whether  it  fails  or 
returns  a  value.*® 

1 17.  To  fill  in  tha  In-ran-of  facet  of  concept  X, 

Ripple  down  tha  tree  of  concepts,  starting  at  Active,  to  empirically  determine  which  active 
concepts  can  be  run  to  yield  X’s. 

This  can  usually  be  decided  by  inspecting  the  Domain/range  facets  of  the  Active  concepts. 
Occasionally,  AM  inspects  known  examples  of  some  Active  concept,  to  see  if  any  of  the 
results  are  X’s. 

1 18.  While  filling  in  entries  for  the  ln*dom>of  facet  of  X, 

Look  especially  carefully  for  Operations  which  transform  examples  and  non-examples  into 
each  other; 

.  This  is  even  better  if  the  operation  pushes  boundary  exs/non-exs  'across  the  boundary’. 

This  was  used  to  note  that  Insert  and  Delete  had  a  lot  to  do  with  the  concept  of  Singleton. 
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On*  k«y  fsilurt  of  Litp  which  pormit*  thi*  to  bo  dono  it  tho  Errortot  fooluro. 


App«ndix  3  AM  Oitcov«ry  in  M«th«rMtict  (t  H«uri(tic  3«arch  •251- 

Appendix  3.2.8.  Heuristics  for  the  Definition  facet  of  Any-concept 


Any-concept .  Defn  .  Suggest 

11 9.  If  th«r«  ar*  no  known  dofinitlons  for  cone# pt  X, 

Than  it  it  cruciti  that  AM  ipand  lomo  tima  lookinf  for  such  dafinitioni. 

This  situation  might  occur  if  only  an  Algorithm  is  present  for  some  concept.  In  that  case, 
the  above  rule  would  suggest  a  new,  high-priority  task,  and  AM  would  then  twist  the 
algorithm  into  a  (probably  very  inefficient)  definition.  A  much  more  serious  situation  would 
occur  if  a  concept  were  specified  only  by  its  Intuition  entries  (created,  e.g.,  by  modifying 
another  concept's  intuitions).  In  that  case,  rapidly  formulating  a  precise  definition  would  be 
a  necessity.  Of  course,  this  need  never  arose,  since  all  the  intuitions  were  deleted. 


Any-concept .  Defn  .  Check 


1 20.  When  checkint  the  Definition  facet  of  concept  C, 

Ensure  that  each  member  of  C.Exs  satisfies  all  definitione  present,  and  each  non-example  fails 
all  definitions.  If  there  is  one  dissenting  definition,  modify  it,  and  move  the  offending 
example  to  the  boundary. 

There  is  little  real  "checking"  that  can  be  done  to  a  definition,  aside  from  internal 
consistency:  If  there  exist  several  suposedly-equivalent  definitions,  then  AM  can  at  least 
ensure  they  agree  on  the  known  examples  and  non-examples  of  the  concept.  If  the  Intuitions 
facets  were  permitted,  then  each  definition  could  be  checked  for  intuitive  appeal. 

121.  Whtn  cheeking  ihe  Definition  facet  of  concept  C, 

Try  to  find  and  eliminate  any  redundant  constraints,  try  to  find  and  eliminate  any  circularity, 
check  that  any  recursion  will  terminate. 

Here  are  the  other  few  tricks  that  AM  knows  for  "checking"  a  definition.  For  each  clause  in 
the  rule  above,  AM  has  a  very  limited  ability  to  detect  and  patch  up  "bugs"  of  that  sort. 
Checking  that  recursion  will  terminate,  for  example,  is  done  by  examining  the  argument  to 
the  recursive  call,  and  verifying  that  it  contains  (at  some  level  before  the  original  argument) 
an  application  of  a  LISP  function  on  Destructive-LISP-functions-list.  There  is  no  intelligent 
inference  that  is  going  on  here,  and  for  that  reason  the  process  is  not  even  mentioned 
within  the  body  of  this  document 
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Appendix  3.3.  Heuristics  for  dealing  with  anv  Active  concept 

All  the  rules  below  are  applicable  to  tasks  which  involve  operations,  predicates,  relations, 
functions,  etc.  In  short,  they  apply  to  all  the  concepts  AM  knows  about  which  involve  doing 
something,  which  involve  action. 


Active  .  Fillin 


1 22.  If  tho  current  task  is  to  fill  in  examplss  of  tha  activity  F, 

One  way  to  c«t  them  is  to  run  F  on  randomly  chosen  examples  of  the  domain  of  F. 

Thus,  to  find  examples  of  Equality,  AM  repeatedly  executed  Equality.Alg  on  randomly 
chosen  pairs  of  objects.  AM  found  examples  of  Compositions  by  actually  picking  a  pair  of 
operations  at  random  and  trying  to  compose  them.  Of  course,  most  such  "unmotivated" 
compositions  turned  out  to  be  uninteresting. 

123.  While  filling  in  examples  of  the  activity  F,  by  running  F.AIgs  on  random  arguments  from 

F.Domain, 

It  is  worth  the  effort  to  specifically  include  extreme  or  boundary  examples  of  the  domain  of 
F,  among  the  arguments  on  which  F.AIgs  is  run. 


1 24.  To  fill  in  a  Domain  entry  for  active  concept  F, 

Run  F  on  various  entities,  rippling  down  the  tree  of  concepts,  to  determine  empirically  where 
F  seems  to  be  defined. 

This  may  shock  the  reader,  as  it  sounds  dumb  and  explosive,  but  the  concepts  are  arranged 
in  a  tree  (using  Cent  links),  so  the  search  is  really  quite  fast.  Although  this  rule  is  rarely 
used,  it  always  seems  to  give  surprisingly  good  results. 

1 25.  To  fill  in  generalizations  of  active  F, 

Consider  just  extending  F,  by  enlarging  its  domain.  Revise  F.Defn  as  little  as  pOi::>ible. 

Although  Equality  is  initially  only  for  structures,  AM  extends  it  (using  the  same  definition, 
actually)  to  a  predicate  over  all  pairs  of  entities. 

1 26.  To  fill  in  specializations  of  active  F, 

Consider  just  restricting  F,  by  shrinking  its  domain.  Check  F.Defn  to  see  if  some  optimization 
is  possible. 


127.  After  an  algorithm  is  known  for  F,  if  AM  wants  a  better  one, 

AM  is  permitted  to  ask  the  user  to  provide  a  fast  but  opaque  algorithm  for  F. 

This  was  used  a  few  times,  especially  for  inverse  functions.  A  nontrivial  open-ended 
research  problem  (>>)''*  is  to  collect  a  body  of  rules  which  transform  an  inefficient  algorithm 
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into  a  computationally  acceptable  one. 

1 28.  If  th«  currant  talk  is  to  fill  in  boundiry  (non*)«xamples  of  iho  activity  F, 

Ono  way  to  got  thorn  is  to  run  F  on  randomly  choion  boundary  oxamplos  and  (with  propor 
safoguards)  boundary  non-oxamplos  of  tho  domain  of  F. 

Proper  safeguards  are  required  to  ensure  that  FAIgs  doesn’t  loop  or  cause  an  error  when 
fed  a  slightly-wrong  (slightly-illegal)  argument.  In  LISP,  a  timer  and  an  ERRORSET 
suffice  as  crude  safeguards. 

129.  If  tho  currant  task  is  to  fill  In  (boundary)  non*axamplos  of  tho  activity  F, 

Ono  low-intorost  way  to  got  thorn  is  to  run  F  on  randomly  choson  oxamplos  of  its  domain, 
and  than  roplaco  tho  valuo  obtainod  by  soma  othor  (vary  simitar)  valuo.  Also,  bo  sura 
to  chock  that  tho  rosultant  i/o  pair  doosn't  aceidontally  satisfy  F.Dofn. 

The  parentheses  in  the  above  rule  mean  that  it  is  really  two  rules:  for  boundary  non¬ 
examples,  Just  change  the  final  value  slightly.  For  typical  non-examples,  change  the  result 
significantly.  If  you  read  the  words  inside  in  the  parentheses  in  the  IF  part,  then  read  the 
words  inside  the  parentheses  in  the  THEN  part  as  well,  or  omit  them  in  both  cases. 


Active  .  Check 

130.  Whan  chocking  an  algorithm  for  activo  F, 

run  that  algorithm  and  onsuro  that  tho  input/output  satisfy  F.Dofn. 


131.  Whan  chocking  a  dofinition  d  for  activo  concopt  F, 

Run  ono  of  its  algorithms  and  onsuro  that  iho  input/output  satisfy  d. 

This  is  the  converse  of  the  preceding  rule.  They  simply  say  that  the  definition  and 
algorithm  facets  must  be  mutually  consistent. 

132.  While  chocking  oxamplos  or  boundary  oxamplos  of  (ho  activo  concopt  F, 

Ensure  that  each  input/output  pair  is  consistent  with  F.0om/rango. 

If  the  domain/range  entry  is  <D1  D2...  Dk  •*  R>,  and  the  i/o  pair  is  <dj  d2...  ,  r>,  then 

each  component  dj  of  the  input  must  be  an  example  of  the  corresponding  Di,  and  the 
output  r  must  be  an  example  of  R. 

133.  Whan  chocking  oxamplos  of  tho  active  concept  F, 

If  any  argument(s)  to  F  were  concepts,  tag  (heir  ln>domain*of  facets  with  'F'. 

If  any  values  produced  by  F  are  concepts,  tag  thoir  ln*rango-of  facets  with  'F*. 

For  example,  Restrict(Unlon)  produced  Add,  at  one  time  in  AM’s  history.  Then  the  above 
rule  caused  ‘Restrict’  to  be  inserted  as  a  new  entry  on  Union.In-dom-of  and  also  on  Add.In- 
ran-of. 
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Active  .  Suggest 

1 34.  If  thor*  ar*  no  known  algorithms  for  activa  concapt  F, 

Than  AM  should  spend  soma  tima  looking  for  such  algorithms. 

This  situation  might  occur  if  only  a  Definition  is  present  for  some  operation.  In  that  case, 
the  above  rule  would  suggest  a  new,  high-priority  task,  and  AM  would  then  twist  the 
definition  into  a  (probably  very  inefficient)  algorithm.  The  rule  below  is  similar,  for  the 
Domain/range  facet: 

135.  If  the  Domain/ranga  faeat  of  activa  concapt  F  is  blank, 

Than  AM  should  spand  soma  tima  looking  for  spacifications  of  F's  domain  and  rsnga. 


136.  If  a  Domain  of  activa  concept  F  is  encountered  fraquantly,  either  within  conjectures  or  as  the 

domain  or  range  of  other  operations  and  predicates, 

Than  define  that  Domain  as  a  separata  concapt,  and  raise  the  Worth  of  F  slightly. 

The  ‘Domain*  here  refers  to  the  sequence  of  components,  whose  cartesian  product  is  what  is 
normally  referred  to  in  mathematics  as  the  domain  of  the  operation.  This  led  to  the 
definition  of  "An'^thing  x  Structures",  which  is  the  domain  of  several  Insertion  and  Deletion 
operations.  Membership  testing  predicates,  etc. 

137.  It  is  worthwhile  to  axplicitly  calculate  the  value  of  F  for  all  distinguished  (extrema,  boundary, 

interesting)  members  of  and  subsets  of  its  domain. 


138.  If  soma  domain  component  of  F  has  a  vary  intarasting  specialization. 

Than  consider  restricting  F  (along  that  component)  to  that  smaller  domain. 

Note  that  these  last  couple  rules  deal  with  the  image  of  interesting  domain  items.  The  next 
rule  deals  with  the  inverse  image  (pre-imag«.)  of  unusual  range  items.  We  saw  earlier  in  this 
document  (Chapter  2)  how  this  rule  led  to  the  definition  of  Prime  numbers. 

139.  If  tha  ranga  of  F  contains  intarasting  itams,  or  an  intarasting  spacialization. 

Than  it  is  worthwhila  to  consider  thair  inverse  image  under  F. 


140.  When  trying  to  fill  in  new  Algorithms  for  Activa  concept  F, 

Try  to  transform  any  conjectures  about  F  into  (pieces  of)  new  algorithms. 

This  is  one  place  where  a  sophisticated  body  of  transformation  rules  might  be  inserted. 
Others  are  working  on  this  problem  [Burstall  &  Darlington  75],  and  AM  only  contains  a 
few  simple  tricks  for  turning  conjectures  into  procedures.  For  example,  "All  primes  are  odd, 
except  is  transformed  into  a  more  eficient  search  for  primes:  a  separate  test  for  x-2, 
followed  by  a  search  through  only  Odd-numbers. 

141.  Aftar  trying  in  vain  to  fill  in  axamplas  of  activa  concapt  P, 

Locata  tha  domain  of  F,  and  suggast  that  AM  try  to  fill  in  axamplas  for  aach  component  of 
that  domain. 
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Thus  after  failing  to  find  examples  for  Set-union,  AM  was  told  to  find  examples  of  Sets, 
because  that  could  have  let  the  previous  task  succeed.  There  is  no  recursion  here:  after  the 
sets  are  found,  AM  will  not  automatically  go  back  to  finding  examples  of  Set-union.  In 
practice,  that  task  was  eventually  proposed  and  chosen  again,  and  succeeded  this  time. 

142.  After  working  on  an  Activa  concept  F, 

Give  a  slight,  ephemeral  boost  to  tasks  involving  Domain(F):  give  a  moderate  size  boost  to 
each  task  which  asks  to  fill  in  examples  of  that  domain/range  component,  and  give  a 
very  tiny  boost  to  each  ether  task  mentioning  such  a  concept. 

This  is  both  a  supplement  to  the  more  general  "focus  of  attention"  rule,  and  a  nontrivial 
heuristic  for  finding  valuable  new  tasks.  It  is  the  partial  converse  of  rule  14. 


Active  .  Interest 

143.  An  active  concept  F  is  interesting  if  there  are  other  operations  with  the  same  domain  as  F, 

and  if  they  are  (on  the  average)  fairly  interesting.  If  the  other  operations'  domain  is 
only  similar,  then  they  must  be  very  Interesting  and  have  somo  valuable  conjectures 
tied  to  them,  if  they  are  to  be  allowed  to  push  up  F's  interestingness  rating. 

The  value  of  having  the  same  domain/range  is  the  ability  to  compose  with  them.  If  the 
domain/range  is  only  similar,  then  AM  can  hope  for  analogies  or  for  partial  compositions. 

1 44.  An  active  concept  is  interesting  if  it  was  recently  crested. 

This  is  a  slight  extra  boost  given  to  each  new  operation,  predicate,  etc.  This  bonus  decays 
rapidly  with  time,  and  thus  so  will  the  overall  worth  of  the  concept,  unless  some  interesting 
property  is  encountered  quickly. 

1 45.  An  active  concept  is  interesting  if  its  domain  is  very  interesting. 

An  important  common  cue  of  this  rule  is  when  the  domain  is  interesting  because  ail  its 
members  are  equal  to  each  other.  The  corresponding  statement  about  rangts  does  exist,  but 
only  operations  can  be  said  to  have  a  specific  range  (not,  e.g.  Predicates).  Therefore,  the 
'range'  rule  is  listed  under  Operation.Interest,  as  rule  number  165. 
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Aopeiidix  3.4.  Heuristics  for  dealings  with  any  Predicate 

Each  of  these  heuristics  can  be  assumed  to  be  prefaced  by  a  clause  of  the  form  "If  the 
current  task  is  to  deal  with  concept  X,  where  X  isa  Predicate,...".  This  will  be  repeated 
below,  for  each  rule. 


Predicate  .  Fillin 

146.  If  the  current  task  was  (Fill'in  examples  of  X), 
and  X  it  a  predicate, 

and  more  than  100  items  are  known  in  the  domain  of 
and  at  least  10  epu  seconds  were  spent  trying  to  randomly  instantiate  X, 
and  the  ratio  of  succetset/failures  it  both  >0  and  last  than  .05 
Then  add  the  following  task  to  the  agenda:  (Fill-in  generalizations  of  X),  for  the  following  reason: 
"X  it  rarely  satisfied;  a  slightly  lets  restrictive  concept  might  be  more  interesting". 

This  reason's  rating  is  computed  as  three  times  the  ratio  of  nonexamples/examples  found. 

This  rule  says  to  generalize  a  predicate  if  it  rarely  succeeds  (returns  T).  One  use  for  this 
was  when  Equality  was  found  to  be  quite  rare;  the  resultant  generalizations  did  indeed  turn 
out  to  be  more  valuable  (numbers).  A  similar  use  was  found  for  predicates  which  tested  for 
identical  equality  of  two  angles,  of  two  triangles,  and  of  two  lines.  Their  generalizations 
were  also  valuable  (congruence,  similarity,  parallel,  equal-measure).  Most  rules  in  this 
appendix  are  not  presented  with  the  same  level  of  detail  as  the  preceding  one,  as  the  reader 
has  no  doubt  observed. 

147.  To  fill  in  Domain/rango  enirios  for  predicate  P, 

P  can  operate  on  the  domain  of  any  specialization  of  P, 

P  can  operate  on  any  specialization  of  the  domain  of  P, 

P  can  operate  on  some  restriction  of  the  domain  of  any  generalization  of  P, 

P  may  be  able  to  operate  on  some  enlargement  of  its  current  domain. 

The  range  of  P  will  necessarily  be  the  doubleton  set  {T,F}, 

P  is  guaranteed  return  T  if  any  of  its  specializations  do,  and  F  if  any  of  its  generalizations  do. 

This  contains  a  compiled  version  of  what  we  mean  when  we  say  that  one  predicate  is  a 
generalization  or  specialization  of  another.  Viewed  as  relations,  as  subsets  of  a  Cartesian- 
product  of  spaces,  this  notion  of  general/special  is  Just  that  of  superset/subset.  The  last  line 
of  the  rule  is  meant  to  indicate  that  adding  new  constraints  onto  P  can  only  make  it  return 
True  less  frequently,  while  relaxing  P’s  definition  can  only  make  it  return  True  more  often. 


Predicate  .  Suggest 

148.  If  all  the  values  of  Active  concept  F  happen  to  be  Truth-values,  and  F  is  not  known  to  be  a 
predicate. 

Then  conjecture  that  F  is  in  fact  a  predicate. 

This  rule  is  placed  on  the  Suggest  facet  because,  if  placed  anywhere  else  on  this  concept,  it 
could  only  be  seen  as  relevant  by  AM  if  AM  already  knew  that  F  were  a  predeicate.  On 
the  other  hand,  the  rule  can’t  be  placed,  e.g.,  on  Active.Fillin,  since  just  forgetting  (deleting) 
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this  "Predicate"  concept  should  be  enough  to  delete  all  references  to  predicates  anywhere  in 
the  system. 


Predicate  .  Interest 


1 49.  A  predicate  P  it  interestins  if  its  domain  it  Any-concept  (the  space  of  all  Known  concepts). 
This  It  especially  true  if  there  it  a  sifnificant  positive  correlation  (theoretical  or 
empirical)  between  concepts'  worths  and  their  P'Values. 

This  very  high  level  heuristic  wasn’t  really  used  by  AM  during  its  runs. 
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Operation  .  Fillin 

150.  To  fill  in  oxamplos  of  oporation  F  (with  dotrain  A  and  rans*  B), 
whon  many  oxamplas  ed  of  A  art  alraady  known, 

and  F  maps  somo  of  those  examples  ec  into  distintuished  members  (esp:  extrema)  b  of  B, 

Then  (for  each  such  distintuished  member  'V^B)  study  F''(b)  as  a  new  concept.  That  is, 
isolate  those  members  of  A  whose  F*vslue  is  the  unusual  item  b<B. 

This  rule  says  to  investigate  the  inverse  image  of  an  unusual  item  b,  under  the  interesting 
operation  f.  When  b»2  and  f»number-of-divisors-of,  this  rule  leads  to  the  definition  of 
prime  numbers.  When  b-Phi*®  and  f-Intersection,  the  rule  led  to  the  discovery  of  the 
concept  of  disjointness  of  sets. 

151.  To  fill  in  Domain/ranga  antrias  for  oparation  F, 

F  can  oparata  on  tha  domain  of  any  spacialization  of  F, 

F  can  oparata  on  tha  spacialization  of  tha  domain  of  any  spacialization  of  F  (including  F 
itself), 

F  can  operate  on  soma  restriction  of  the  domain  of  any  generalization  of  F,  at  least  on  its 
currant  domain  and  perhaps  even  on  a  bigger  space, 

F  may  be  able  to  oparata  on  come  generalization  of  (soma  eomponant(s)  of)  its  currant 
domain, 

F  can  only  (and  will  always)  produce  values  lying  in  tha  range  of  each  generalization  of  F, 

F  can  with  tha  proper  arguments  produce  vaiuas  lying  in  tha  range  of  any  particuiar 
speciaiization  of  F. 

There  are  only  a  few  changes  between  this  rule  and  the  corresponding  one  for  Predicates. 
Recall  that  Operations  can  be  multi-valued,  and  those  values  are  not  limited  to  the  set 
{T,F}. 

152.  To  fill  in  Domain/range  entries  for  oparation  F,  whan  soma  exist  alraady. 

Taka  an  entry  of  tha  form  <D1  D2...  On  -*  R>  and  sea  if  DixR  is  meaningful  for  some,! 
(especially:  i-n). 

If  so,  than  remove  Di  from  the  left  side  of  the  entry,  and  replace  R  by  DixR,  and  modify  the 
definition  of  F. 

In  LISP,  "meaningful"  is  coded  as:  either  DixR  is  equivalent  to  an  already-known  concept, 
or  else  it  is  found  in  at  least  two  interesting  conjecture.'.  This  is  probably  an  instance  of 
what  McDermott  calls  natural  stupidity'*.  This  rule  is  tagged  as  being  explosive,  and  is  not 
used  very  often  by  AM. 

153.  To  fill  in  a  Range  entry  for  operation  F, 

Run  F  on  various  domain  examples,  especially  boundary  examples,  to  collect  examples  of  the 
range.  Than  ripple  down  tha  tree  of  concepts  to  determine  empirically  where  F  seams 
to  be  sending  its  vaiuas. 

1h«  tmpty  t«t,  NIL,  {} 

'*  Sm  [McDtrmott  76]  for  Mtur«l  stupidity.  Ht  critieizss  th«  uso  of  vary  inUlli(snt-soundin|  narnts  for  othorwiss-simplo 
profram  modulss.  But  coosklor  ‘Home  sspisns*,  which  moans  "wisa  man*.  Now  thsra’s  a  mislaading  labol... 
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This  may  shock  the  reader,  as  it  sounds  dumb  and  explosive,  but  the  concepts  are  arranged 
in  a  tree  (using  Cent  links),  so  the  search  is  really  quite  fast.  Although  this  rule  is  rarely 
used,  it  always  seems  to  give  surprisingly  good  results. 

1 54.  If  operation  F  hat  fu$t  been  applied,  end  haa  yielded  a  new  concept  C  as  itt  result, 

Then  carefully  examine  F.Dom/ran|e  to  try  to  find  out  what  C.ita  should  be.  C.lta  will  be  all 
legal  entriea  listed  at  values  of  the  range  of  F. 

When  F-Compose,  say  AM  has  just  created  C-EmptyoIn$ert.*^  What  is  C?  It  is  a  concept, 
of  course,  but  what  else?  By  examining  the  Domain/range  facet  of  Compose,  AM  finds  the 
entry  <Active  Active  -♦  Active>.  Aha!  So  C  must  be  an  Active.  But  AM  also  finds  the  entry 
<Predicate  Active  •*  Predicate>.  Since  "Empty”  is  a  predicate,  the  final  composition  C  must 
also  be  a  predicate.  So  C.Isa  would  be  filled  in  with  "Predicate".  AM  thus  used  the  above 
rule  to  determine  that  Emptyolnsett  was  a  predicate.  Even  if  this  rule  were  excised,  AM 
could  still  determine  that  fact,  painfull),  by  noticing  that  all  the  values  were  truth-values. 

155.  If  oporalion  F  hit  just  boon  applioo  to  A1,A2,...,  and  has  yioldod  a  now  concopt  C  as  ito 

rosult, 

Than  add  F  to  C.ln-ran’of;  add  F  to  tho  ln>dom*of  facot  of  all  tho  Ai*s  which  ar#  concepts; 
add  <A1...  C>  to  F.Exb. 

There  is  some  overlap  here  with  earlier  rules,  but  there  is  no  theoretical  or  practical 
difficulty  with  such  redundancy. 

156.  Whon  filling  in  oxamplot  of  operation  F,  if  F  takes  some  existing  coneepti  Al,  A2,...  and 

(may)  produce  a  new  concept, 

Then  only  coniider,  aa  potential  A|’s,  those  concepts  which  already  have  some  examples. 
Prefer  the  A|’s  to  be  interesting,  to  have  a  high  worth  rating,  to  have  some  interesting 
conjectures  about  them,  to  have  several  examples  and  several  non'’examples,  etc. 

The  danger  here  is  of,  e.g..  Composing  two  operations  which  turn  out  to  be  vacuous,  or  of 
Conjoining  an  empty  concept  onto  another,  or  of  proliferating  variants  of  a  boring  concept, 
etc. 


Operation  ,  Check 

Below  are  rules  used  to  check  existing  entries  on  various  facets  of  operations. 


/ 
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157.  To  chock  tho  domain/ringo  ontrios  on  tho  oporation  F, 

IF  a  domain/rango  ontry  has  tho  form  (D  D  0...  ->  R), 

and  all  tho  D's  aro  oqual,  and  R  it  a  gonoraliiation  of  D  (or,  with  lost  onthuaiaam:  if  R  and  D 
havo  a  significant  ovorlap), 

THEN  it’s  worth  sooing  whothor  (D  D  D...  -*  D)  it  eonaiatont  with  all  known  oxamplot  of  tho 
oporation: 

If  thoro  aro  no  known  oxamplot,  add  a  taak  to  tho  aganda  roquoating  thoy  bo  fillod 
in. 

If  thoro  aro  oxamplot,  and  (D  D  D...  -»  D)  it  eonaiatont,  add  it  to  tho  Domain/rango 
faeot  of  this  oporation. 

If  thoro  aro  aomo  contradicting  oxamplot,  croato  a  now  concopt  which  it  dofinod 
at  this  oporation  rostrictod  to  (D  D  D...  D). 

When  AM  restricts  Bag-union  to  numbers  (bags  of  T’s),  the  new  operation  has  a 
Domain/range  entry  of  the  form  (Numbers  Numbers  -»  Bag).  The  above  rule  has  AM 
investigate  whether  the  range  specification  mightn’t  also  be  narrowed  down  to  Number.  In 
this  case  it  is  a  great  help.  The  rule  often  fails,  of  course:  the  sum  of  two  primes  is  rarely  a 
prime,  the  cross-product  of  two  lists-of-atoms  is  not  a  list-of-atoms,  etc.  Since  this  rule  is 
almost  instantaneous  to  execute,  it’s  cost-effective  overall. 

158.  When  checking  fhe  domain/range  entries  on  the  operation  F, 

IF  a  domain/range  entry  has  the  form  (D  D  D...  *4  R), 

and  all  the  D's  are  equal,  and  R  is  a  specialization  of  D, 

THEN  it’s  worth  inserting  (0  0  0...  -*  D)  as  a  new  entry  on  F.Dom/ran,  even  though  that  is 
redundant. 

This  shows  that  symmetry  and  aesthetics  are  sometimes  preferable  to  absolute  optimization. 
That’s  why  we  program  in  Lisp,  instead  of  machine  language.  On  the  other  hand,  this  rule 
wasn’t  really  that  useful  to  AM.  Now,  by  analogy,...? 

159.  When  checking  the  Domain/range  entries  for  operation  F, 

Ensure  that  the  boundary  items  in  the  range  can  actually  be  reached  by  F.  If  not,  see 
whether  the  range  is  really  just  some  known  specialization  of  F. 

This  rule  is  a  typical  checking  rule.  Note  that  it  is  active,  not  passive:  it  might  alter  the 
Domain/range  facet  of  F,  it  it  finds  an  error  there. 

160.  When  cheeking  examples  of  the  operation  F,  for  each  such  example. 

If  the  value  returned  by  F  is  a  concept  C,  add  T’  to  C.ln*range-of. 


Operation  .  Suggest 


161.  Whenever  the  domain  of  operation  F  has  changed, 

check  whether  the  range  has  also  changed.  Often  the  range  will  change  analogously  to  the 
domain,  where  the  operation  itself  is  the  Analogy. 

162.  After  working  on  Operation  F, 

Give  a  slight,  ephemeral  boost  to  tasks  involving  Range(F). 
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This  wit  be  a  moderate  size  boost  for  each  task  which  asks  to  Ail  in  examples  of  that  range 
concept,  and  a  very  tiny  boost  for  each  other  task  mentioning  such  a  concept.  This  is  both 
a  supplement  to  the  more  general  "focus  of  attention"  rule,  and  a  nontrivial  heuristic  for 
finding  valuable  new  tasks.  It  is  an  extension  of  rule  number  H2,  and  a  partial  converse  to 
rule  H. 


Operation  .  Interest 

163.  An  operation  F  it  interottini  if  there  are  ether  eperatiom  with  the  tame  domain  and  ranfo, 
and  if  they  are  (on  the  average)  fairly  intereating. 


164.  An  operation  F  ia  intereating  if  it  ia  the  firat  operation  conneeting  ita  domain  concept  to  ita 

range  concept,  and  if  thoae  domain/range  componenta  are  themaeivea  valuable 
concepta,  and  there  ia  no  analogy  between  them,  and  there  are  tome  intereating 
conjecturea  involving  the  domain  of  F. 

The  above  two  rules  say  that  F  can  be  valuable  becuase  it*i  similar  to  other,  already-liked 
operations,  or  because  it  is  totally  different  from  any  known  operation.  Although  these  two 
criteria  are  nonintersecting,  their  union  represenu  only  a  small  fraction  of  the  operations 
that  get  created:  typically,  ntithtr  rule  will  trigger. 

165.  An  operation  F  it  interesting  if  its  range  it  very  interesting. 

Range  here  refers  to  the  concept  in  which  all  results  of  F  must  lie.  It  is  the  R  in  the 
domain/range  facet  entry  <D  R>  for  concept  F.  The  corresponding  rule  for  'domains'  is 
applicable  to  any  Active,  not  Just  to  Operations,  hence  is  listed  under  Activelnterest,  as  rule 
number  145. 

166.  An  operation  F  is  interosting  if  the  values  of  F  satisfy  tome  unusual  property  which  ia  not  (in 

general)  satisfied  by  the  arguments  to  F. 

Thus  doubling  is  interesting  because  it  always  returns  an  even  number.  This  is  one  case 
where  the  interesting  property  can  be  deduct  trivially  Just  by  looking  at  the  domain  and 
range  of  the  operation:  Numbers>*£ven-nos. 

167.  An  operation  is  interesting  if  its  values  are  interesting. 

This  can  mean  that  each  value  is  interesting  (e.g..  Compose  is  well-received  because  it 
produc.;S  many  new,  valuable  concepts  as  its  values).  Or,  it  can  mean  that  the  operations' 
values,  gathered  together  into  one  big  set,  are  interesting  as  a  set.  Unlike  the  preceding  rule, 
this  one  has  no  mention  whatsoever  of  the  domain  items,  the  arguments  to  the  operation. 
This  rule  was  used  to  good  advantage  frequently  by  AM.  For  example.  Factorings  of 
numbers  are  interesting  because  (using  rule  232)  for  ail  x,  Factorings(x)  is  interesting  in 
exactly  the  same  way.  Namely,  Factorings(x),  viewed  as  a  set,  always  contains  precisely  one 
item  which  has  a  certain  interesting  property  (see  rule  233).  Namely,  ail  its  members  are 
primes  (see  rule  232  again).  This  explains  one  way  in  which  AM  noticed  that  ail  numbers 
seem  to  factor  uniquely  into  primes. 
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168.  An  operation  is  interesting  if  its  values  are  interesting,  ignoring  the  images  of  boundary 

items  from  the  domain. 

That  is,  if  the  image  of  the  domain  -  minus  its  boundary  -  is  interesting. 

169.  An  operation  is  interesting  if  its  values  on  the  boundary  items  from  the  domain  are  vary 

interesting.  Ignore  the  non'boundary  parts  of  the  domain. 

That  is,  if  the  image  of  the  boundary  of  the  domain  is  interesting. 

170.  An  operation  is  interesting  if  it  leaves  intact  any  interesting  properties  of  its  argumentfs). 

This  is  oven  better  if  it  eliminates  some  undesirable  properties,  or  adds  soma  new, 
desirable  ones. 

Thus  a  new,  specialized  kind  of  Insertion  operation  is  interesting  if,  even  though  it  stuffs 
more  items  into  a  structure,  the  nice  properties  of  the  structure  remain.  The  operation 
"Merge"  is  interesting  for  this  very  reason:  it  inserts  items  into  an  alphabetized  list,  yet  it 
doesn’t  destroy  that  interesting  property  of  the  list. 

171.  An  operation  is  interesting  if  its  domain  and  range  are  equal.  If  there  is  more  than  one 

domain  component,  then  at  least  one  of  them  should  equal  the  range.  The  more 
components  which  are  equal  to  the  range,  the  bettor. 

Thus  "Insertion"  qualifies  here,  since  its  domain/range  entry  is  < Anything  Structures  -+ 
Structures>.  But  "Union"  is  even  better,  since  both  domain  components  equal  the  range, 
namely  Structures. 

172.  An  operation  is  mildly  interesting  if  its  range  is  related  somehow  (e.g.  specialization  of)  to 

one  or  more  components  of  its  range.  The  more  the  better. 

A  weakened  form  of  the  preceding  rule. 

173.  If  the  result  of  applying  operation  F.  is  a  new  concept  C, 

Then  the  interestingness  of  F  is  weakly  tied  to  that  of  C. 

If  the  new  concept  C  becomes  very  valuable,  then  F  will  rise  slightly  in  interest.  If  C  is  so 
bad  it  gets  forgotten,  F  will  not  be  regarded  quite  as  highly.  When  Canonize  scores  big  its 
first  time  used,  it  rises  in  interest.  This  caused  AM  to  form  poorly-motivated  canonizations, 
which  led  to  dismal  results,  which  gradually  lowered  the  rating  of  Canonize  to  where  it  was 
originally. 
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Appendix  S.6.  Heuristics  for  dealing  with  anv  Composition 


Composition  .  Fillin 


174.  To  fill  in  iltorithms  for  operation  F,  whoro  F  is  ■  composition  GoH, 

One  alcorithm  is:  apply  H  and  than  apply  G  to  the  result. 

Of  course  this  rule  is  not  much  more  than  the  definition  of  what  it  means  to  compose  two 
operations. 

175.  To  fill  in  Domain/range  entries  for  operation  F,  where  F  is  a  composition  GoH, 

Tentatively  assume  that  the  domain  is  Oomain(H),  and  range  is  Rsnge(G).  More  precisely,  the 

domain  will  be  the  result  of  substituting  Domain(H)  for  Range(H)  wherever  Range(H) 
appears  (or:  just  once)  in  Domain(G). 

Thus  for  F-DividesoCount,  where  Divides:<Number, Number  -»  {T,F}>,  and  Count:<Bag  -♦ 
Number>,  the  above  rule  would  say  that  the  domain/range  entries  for  F  are  gotten  by 
substituting  'Bag’  for  'Number’  once  or  twice  in  Domain(Divides).  The  possible  entries  for 
F.Dom/range  are  thus:  <Bag,Bag  -♦  {T,F}>,  <Number,Bag  -*  {T,F}>,  and  <Bag, Number  -♦ 
{T,F}>. 

176.  To  fill  in  Domain/range  entries  for  operation  F,  where  F  is  a  composition  GoH,  But  RangefH) 

does  not  occur  as  a  component  of  Domain(G), 

The  range  of  F  is  still  Rsnge(G),  but  the  domain  of  F  is  computed  as  follows:  Ascertain  the 
component  X  of  Domain(G)  having  the  biggest  (fractional)  overlap  with  Range(H).  Then 
substitute  Domsin(H)  for  X  in  Oomsin(G).  The  result  is  the  value  to  be  used  for 
Oomain(F). 

This  rule  is  a  second-order  correction  to  the  previous  one.  If  there  is  no  absolute  equality, 
then  a  large  intersection  will  suffice.  Notice  that  F  may  no  longer  be  defined  on  all  of  its 
domain,  even  if  G  and  H  are.  If  identical  equality  is  taken  as  the  maximum  possible 
overlap  betwen  two  concepts,  then  this  rule  can  be  used  to  replace  the  preceding  one 
completely. 

177.  Whon  trying  to  fill  in  tho  Isa  ontrias  for  a  composition  FafSoH, 

Examine  G.lsa  and  H.lsa,  and  tspocially  their  intersection.  Some  of  those  concepts  may  also 
claim  F  as  an  example.  Run  their  definition  facet  to  see. 

To  see  how  this  is  encoded  into  LISP,  turn  to  page  219. 

178.  When  trying  to  fill  in  the  GenI  or  Spec  entries  for  a  composition  F^GoH, 

Examine  the  corresponding  facet  on  G  and  on  H. 

This  rule  is  similar  to  the  preceding  one,  but  wasn’t  as  useful  or  as  reliable. 


179.  A  satisfactory  initial  guess  si  the  Worth  value  of  composition  F>GoH  is  the  root-sum^of* 
squares  of  G.Worth  and  H.Worth. 
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180.  To  fill  in  oxampltt  of  F,  wharo  F>GoH,  and  both  G  and  H  aro  time-consuminc,  but  whoro 

many  axamplas  of  both  G  and  H  oxist, 

SooK  an  oxamplo  x-»y  of  H,  and  an  oxamplo  y*^z  of  G,  and  th«n  roturn  x-»z  at  a  probablo 
•xampio  of  F. 

Above,  ‘seek’  is  done  in  a  tight,  cfficent  manner.  The  examples  are  H  are  hashed  into  an 
array,  based  on  the  values  y  of  each  one.  Then  the  arguments  of  the  examples  of  G  are 
hashed  to  see  if  they  occur  in  this  array.  Those  that  do  will  generate  an  example  of  the  new 
composition. 

181.  To  fill  in  oxamplti  of  F,  whtrt  F«GoH,  and  G  it  timocontuming,  but  many  oxamplot  of  G  oxitt, 

and  it  is  not  known  whothtr  H  it  timo-eonauming  or  not, 

Spond  a  moment  trying  to  accost  or  trivially  fill  in  oxamplot  of  H. 

If  this  tuccoodt,  apply  tho  procoding  rulo. 

H  this  fails,  thon  formally  propoto  that  AM  fill  in  oxamplos  of  H,  with  priority  oqual  to  that  of 
tho  curront  task,  for  thoso  two  roasont:  (i)  if  oxamplos  of  H  oxittod,  thon  AM  could 
havo  usod  tho  houristic  procoding  this  ono,  to  fill  in  oxamplot  of  F,  and  (ii)  It  is 
dangorout  to  tpond  a  long  timo  doaling  with  GoH  boforo  any  oxamplot  at  all  of  H  aro 
known. 

This  rule  is  of  course  tightly  coupled  to  the  preceding  one.  The  same  rule  exists  for  the 
case  where  just  H  is  time-consuming,  instead  of  G. 

182.  Whon  trying  to  fill  in  Conjocs  about  a  composition  F«GeH, 

Consider  that  F  may  bo  tho  samo  as  G  (or  tho  tamo  as  H). 

It  was  somewhat  depressing  that  this  'stupid'  heuristic  turned  out  to  be  valuable,  perhaps 
even  necessary  for  AM’s  top  performance. 


Composition  .  Check 


183.  Chock  that  FoG  is  roally  not  tho  samo  as  F,  or  tho  samo  as  G.  Spond  torn#  (into  chocking 

whothor  FoG  it  oquivalont  to  any  alroadyknown  activo  concept. 

This  happens  often  enough  to  make  it  worth  stating  explicitly.  Often,  for  example,  F  will 
not  even  bother  looking  at  the  result  of  G!  For  example, 

ProJ2oSquare(x,y)  -  ProJ2(Squarc(x),y)  -  y  -  ProJ2(x,y). 

184.  Whon  chocking  tho  Algorithms  entries  for  a  composition  FoGoH, 

If  rango(H)  is  not  wholly  contained  in  tho  domain  of  G, 

thon  tho  algorithm  mutt  contain  a  "legality"  check,  ensuring  that  H(x)  is  a  valid  member  of 
tho  domain  of  G. 
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Coraposition  .  Suggest 

185.  Givan  an  intaraating  oparaiion 
conaidar  compoting  F  with  ittalf. 

This  may  result  in  more  than  one  new  operation.  From  F-division,  for  example,  we  get  the 
two  operations  (x/y)/z  and  x/(y/z).  AM  quickly  realizes  that  such  variants  are  really 
equivalent,  and  (if  prodded)  eventually  realizes  that  F(F(x,y),z)»F(x,F(y,z))  is  a  common 
situation  (which  we  call  associativity  of  F). 

186.  If  tha  nawly-formad  domain  of  tha  composition  FbGoH  contains  mora  than  ona  occurranca  of 

tha  concapt  D,  and  this  isn't  trua  of  G  or  H, 

Than  considar  craating  a  naw  oparation,  a  spacialization  of  F,  by  Coalascing  tha 
domain/ranga  of  F,  by  aliminating  ona  of  tha  D  componants. 

Thus  when  InsertoDelete  is  formed,  the  old  Domain/range  entries  were  both  of  the  form 
<Anything  Structure  -*  Structure>.  The  newly-created  entry  for  InsertoDelete  was  <Anything 
Anything  Structure  •*  Structure>;  i.e.,  take  x,  delete  it  from  S,  then  insert  y  into  S.  The 
above  rule  had  AM  turn  this  into  a  new  operation,  with  domain/range  <Anything  Structure 
-*  Structure>,  which  deleted  x  from  S  and  the  inserted  the  very  same  x  back  into  S. 


Composition  ,  Interest 

187.  A  composition  FaGoH  is  intarasting  if  G  and  H  ara  vary  intaraating. 


188.  A  composition  FaGoH  is  intarasting  if  F  has  an  intarasting  proparty  not  possassad  by  aithar 

Gor  H. 

189.  A  composition  FaGoH  is  intarasting  if  F  Un  most  of  tha  intarasting  propartias  which  ara 

possassad  by  aithar  G  or  H.  This  is  slightly  raducad  if  both  G  and  H  possass  tha 
proparty. 

1 90.  A  composition  FaGoH  is  intarasting  if  F  lacks  any  undasirabla  propartias  trua  of  G  or  H.  This 

is  grastly  incraasad  if  both  G  and  H  possass  tha  bad  proparty,  unlass  G  and  H  ara  vary 
closaly  ralatad  to  aach  othar  (a.g.,  HaG,or  HaG'*). 

The  numeric  impact  of  each  of  these  rules  was  guessed  at  initially,  and  has  never  needed 
tuning.  Here  is  an  area  where  experimentation  might  prove  interesting. 

191.  A  composition  FaGoH  is  intarasting  if  F  maps  intarasting  subsats  of  domain(H)  into  intarasting 

subsats  of  ranga(G). 

F  is  to  ba  judgad  avan  mora  intarasting  if  tha  imaga  was  not  thought  to  ba  intarasting  until 
aftar  it  was  axplicitly  isolatad  and  studiad  baeausa  of  part  1  of  this  vary  rula. 

Here,  an  "interesting"  subset  of  domain(H)  is  one  so  judged  by  Interests(domain(H)).  A 
completely  different  set  of  criteria  will  be  used  to  Judge  the  interestingness  of  the  resultant 
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image  under  F.  Namely,  for  that  purpose,  AM  will  ask  for  range(G).Interest,  and  ripple 
outwards  to  look  for  related  interest  features. 

192.  A  composition  F*GqH  is  intorsstinf  if  F'*  maps  ini«r«tiin{  tubsoit  of  rang«(G)  into 

intoresting  subsets  of  domain(F). 

This  is  even  better  if  the  preimsge  wasn’t  hitherto  realized  as  interesting. 

This  is  the  converse  of  the  preceding  rule.  Again,  "interesting"  is  judged  by  two  different 
sets  of  criteria. 

193.  A  composition  F«GoH  is  interesting  if  F  maps  interesting  elements  of  clomain(H)  into 

interesting  subsets  of  range(G). 


194.  A  composition  F^GoH  is  interesting  if  F‘*  maps  interesting  elements  of  range(G)  into 

interesting  subsets  of  domain(F). 

This  is  even  better  if  the  subset  is  only  now  seen  to  be  interesting. 

This  is  the  analogue  of  an  earlier  rule,  but  for  Individual  items  rather  than  for  whole 
subsets  pf  the  domain  and  range  of  F. 

1 95.  A  composition  F«GeH  is  interesting  if  range(H)  is  equal  to,  not  just  intersects,  one  component 

of  domsin(G). 


1 96.  A  composition  F«GoH  is  mildly  interesting  if  range(H)  is  a  speeislizetion  of  one  component  of 

domain(G). 

This  is  a  weakened  version  of  the  preceding  feature.  Such  a  composition  is  interesting 
because  it  is  guaranteed  to  always  be  applicable.  If  Range(H)  merely  intersects  a  domain 
component  of  G,  then  there  must  be  an  extra  check,  after  computing  H(x),  to  ensure  it  lies 
within  the  legal  domain  of  G,  before  trying  to  run  G  on  that  new  entity  H(x). 

1 97.  A  composition  F>GoH  is  mors  intsrssting  if  rangs(G)  is  equal  to  a  domain  component  of  H. 

This  is  over  and  above  the  slight  boost  given  to  the  composition  because  it  is  an  operation 
whose  domain  and  range  coincide  (see  rule  171). 

Appendix  3.7.  Heuristics  for  dealing  with  anv  Insertions 


Insertion  .  Check 

1 98.  When  checking  an  example  of  any  kind  of  insertion  of  x  into  S, 

Ensure  that  x  is  a  memlMr  of  S. 

The  only  types  of  insertions  known  to  AM  are  unconditional  insertions,  so  this  rule  is  valid. 
It  is  useful  for  ensuring  that  a  particular  new  operation  really  is  an  insertion-operation  after 
ail! 
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Coalesce  .  Fillin 

199.  When  coalatcini  F(a,b,c,...),  whoat  domain/ranf*  ia  <A  B  C...  R>, 

A  (ood  choice  of  two  domain  componenla  to  coaleace  ia  a  pair  of  identically  equal  onea. 
Barring  that,  chooae  a  pair  related  by  apaclallzatlon  (eliminate  the  more  general  one). 
Barring  that,  chooae  a  pair  with  a  common  apecialization  S,  and  replace  both  by  S. 

Thus  to  coalesce  the  operation  "InsertoDelete"  [which  takes  two  items  and  a  structure,  deletes 
the  first  argument  from  the  structure  and  then  inserts  the  second  argument],  AM  examines 
its  Domain/range  entry:  <Anything  Anything  Structure  -» Stracture>.  Although  it  would  be 
legal  to  collapse  the  second  and  third  arguments,  the  above  rule  says  it  makes  more  sense  in 
general  to  ccUapse  the  first  and  second.  In  fact,  in  that  case,  AM  gets  an  operation  which 
tells  it  something  about  multiple  elements  structures. 

200.  Whon  filling  in  Algorithms  for  •  coaloictd  variion  G  of  setivo  concopt  F, 

Ono  natural  algorithm  ia  limpiy  to  call  on  FAIga,  with  two  argumonts  the  samo. 

Of  course  the  two  identical  arguments  are  those  which  have  been  decided  to  be  merged. 
This  will  be  decided  before  the  definition  and  algorithm  facets  are  filled  in.  Thus  a  natural 
algorithm  for  Square  is  to  call  on  TIM£S.AIg(x,x).  The  following  rule  is  similar: 

201.  Wh«n  filling  in  Oofinitiona  for  a  coalticad  version  G  of  activ*  concept  F, 

One  natural  Definition  is  simply  to  call  on  F.0efn,  with  two  arguments  the  same. 

202.  When  filling  in  the  Worth  of  a  new  coalesced  version  of  F, 

A  suitable  value  is  0.9x(Worth  of  F)  *  O.lxlWorth  of  Coalesce). 

This  is  a  compromise  between  (i)  the  knowledge  that  the  new  operation  will  probably  be 
less  interesting  than  F,  and  (ii)  the  knowledge  that  it  may  lead  to  even  more  valuable  new 
concepts  (c.g.,  its  Inverse  may  be  more  interesting  than  F’s).  The  formula  also  incorporates 
a  small  factor  which  is  based  on  the  overall  value  of  coalescings  which  AM  has  done  so  far 
in  the  run. 


Coalesce  .  Check 

203.  If  G  and  H  art  oach  two  coalescings  sway  from  F,  for  any  F, 

Than  chock  that  G  and  H  aren't  really  the  same,  by  writing  their  definitions  out  in  terms  of 
F.Defn. 

Thus  if  R(a,b,c)  is  really  F(a,b,a,c),  and  S(.\,b,c)  is  really  F{a,b,c,c),  and  R  and  S  get  coalesced 
again,  into  G(a,b}  whch  is  R(aib,a)  and  into  H(a,b}  which  is  S(a,b,a),  then  both  G  and  H  are 
really  F(a,b,a,a).  The  order  of  coalescing  is  u  ‘mportant.  This  is  a  boost  to  the  more 
general  Impetus  for  checking  this  sort  of  thing,  rule  110.  This  rule  is  faster,  containing  a 
special-purpose  program  for  untangling  argument-calls  rapidly.  If  the  concept  of  Coalesce  is 
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excised  from  the  system,  one  can  easily  Imagine  it  being  re-derived  by  a  more  general 
'coincidence'  strategy,  but  how  will  these  specific,  high-powered,  tightly-coded  heuristics  ever 
get  discovered  and  tacked  onto  the  Coalesce  concept?  This  is  an  instance  of  the  main  meta¬ 
level  research  problem  proposed  earlier  in  the  thesis  (Chapter  7). 


Gioalesce  .  Suggest 


204.  If  a  newlyinterettinc  aciive  concept  F(x,y)  takes  a  pair  of  N*a  as  arguments, 

Then  create  a  new  concept,  a  specialization  of  F,  called  F-ltself,  taking  just  one  N  as 
argument,  defined  as  F(x,x),  with  initial  worth  Worth(F). 

If  AM  has  never  coalesced  F  before,  this  gets  a  slight  bonus  value. 

If  AM  has  coalesced  F  before,  say  into  $,  then  modify  this  suggestion's  value  according  to  the 
current  worth  of  S. 

The  iower  the  system's  interestMhreshhold  is,  the  nwre  attactive  this  suggestion  becomes. 

AM  used  this  rule  to  coalesce  many  active  concepts.  Times(x,x)  is  what  we  know  as 
squaring;  £quality(x,x)  turned  out  to  be  the  constant  predicate  True;  Intersect(x,x)  turned 
out  to  be  the  identity  operator;  Compose(f,f)  was  an  interesting  "iteration"  operator etc. 
This  tale  is  really  a  bundle  of  little  meta-rules  modifying  one  suggestion:  the  suggestion  that 
AM  coalesce  F.  The  very  last  part  of  the  above  rule  indicates  that  if  the  system  is 
desparate,  this  is  the  least  distasteful  way  to  "take  a  chance"  on  a  high-payoff  high-risk 
course  of  action.  It  is  more  sa:*.:  than,  e.g.,  randomly  composing  two  operations  until  a  nice 
new  one  is  crk.ated. 

205.  If  concept  F  takes  only  one  argument, 

Then  it  is  not  worthwhile  to  try  to  coalesce  it. 

This  rule  was  of  little  help  epu-timewise.  since  even  if  AM  tried  to  coalesce  such  an  active 
concept, it  would  fail  almost  instantaneously.  The  rule  did  help  make  AM  appear  smarter, 
however. 

Appendix  8.9.  Heuristics  for  dealing  with  the  operation  Canonize 


Canonize  .  Fillin 


206.  If  the  task  is  to  Canonize  predicates  PI  and  P2  (over  AxA)'^  and  the  difference  between  a 
definition  of  PI  and  definition  of  P2  is  jusi  that  P2  performs  some  extra  check  that  PI 
doesn't. 

Then  should  convert  any  af  A  info  s  mem'oer  of  A  which  automatically  satisfies  that  extra 
constraint. 

Thus  when  PI -Same-length,  P2-£quafity,  A-Lists,  the  extra  constraint  that  P2  satisfies  is 
just  that  it  recurs  iu  the  CAR  direction:  the  CARs  of  the  two  arguments  must  also  satisfy 


*  ^  t  g,  Compo$«(Compo««,Compot«)  w  tn  opsrstor  which  Uhtt  3  opsritioM  f,t,h  tnd  forint  ’f  o  g  o  h*j  Lo.,  ihtir  joint 
compotKion 

Thti  it,  fmd  t  function  F  toeh  thtt  Pl<x,y)  iff  P2(F(x),F(y)). 
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P2.  PI  doesn’t  have  such  a  requirement.  The  above  rule  then  has  AM  seek  out  a  way  to 
guarantee  that  the  CARs  will  always  satisfy  Equality.  A  special  hand-crafted  piece  of 
knowledge  tells  AM  that  since  "T-T”  is  an  example  of  Equality,  one  solution  is  for  all  the 
CARS  to  be  the  atom  T.  Then  the  operation  F  must  contain  a  procedure  for  converting 
each  ember  of  a  structure  to  the  atom  T".  Thus  (A  C  {Z  A  B}  would  be  converted 
to  (T  T  T  T  T).  This  rule  Is  a  speclallied,  "compiled"  version  of  the  idea  expressed  in  rule 
number  13. 

207.  If  She  ink  it  to  Ctnoni2«  PI  anv  ''  over  AxA,  trying  to  tyntheilze  F,  where  A«Structure, 

Th^n  perhaps  there  it  t  .ted  type  of  ttruriire  B  which  the  nrguntent  to  F  should 

always  be  converted  inly  that  case,  consider  PI  and  P2  as  two  predicatea  over  BxB. 

This  special-purpose  ru>6  is  Ui."  •  guide  a  series  of  experiments,  to  determine  whether  PI 
is  afttTted  by  adding  multiple  copies  of  existing  elements  into  its  arguments,  and  whether  its 
value  is  affected  by  rearranging  some  of  >'  .rguments’  elements.  In  the  case  of  Pl-Same- 
siie,  the  answers  are:  multiple  elements  do  :  ’  •  a  difference,  but  rearrangement  doesn’t.  So 
the  canonical  type  of  structure  for  F-Size  mu.  be  one  which  is  Mult-eles-allowed  and  also 
one  which  is  Unordered.  Namely,  a  Bag.  Thus  F  is  modified  so  that  it  first  converts  its 
argument  to  a  Bag.  Then  Equality  and  Same-size  are  viewed  as  taking  a  pair  of  Bags,  and 
Size  is  viewed  as  taking  one  Bag  and  giving  back  a  canonical  bag. 

208.  After  F  is  created  from  PI  and  P2,  as  Canonize(Pl,P2). 

an  acceptable  value  for  the  worth  of  F  is  the  maximum  of  the  worths  of  PI  and  P2. 

In  the  actual  Lisp  code,  an  extra  small  term  is  added  which  takes  into  account  the  overall 
value  of  all  the  Canonizations  which  AM  has  recently  produced. 

209.  IF  the  current  task  has  jus*  created  a  canonical  specialization  B  of  concept  A,  with  respect  to 

predicates  PI  and  P2,  [i.e.,  two  members  of  B  satisfy  PI  iff  they  satisfy  P2], 

THEN  add  the  fviiowing  entry  to  the  Analogies  fKet  of  B: 

<A  PI  Operations-on-and-into(A)> 

<B  P2  Those  operations  restricted  to  B's> 

This  rather  incoherent  rule  says  that  it’s  worth  taking  the  trouble  to  study  the  behavior  of 
each  operation  when  it  is  restricted  to  working  on  standard  or  "canonical"  items.  Moreover, 
some  of  the  old  relationships  may  carry  over  -  or  at  least  have  analogues  -  in  this 
restricted  world.  When  numbers  are  discovered  as  canonical  bags,  all  the  bag  operations 
are  restricted  to  work  on  only  canonical  bags,  and  they  magically  turn  into  what  we  know 
and  iove  as  numeric  operations.  Many  of  the  old  bag-theoretic  relationships  have  numeric 
analogues.  Thus  we  knew  that  the  bag-difference  of  x  and  x  was  the  empty  bag;  this  is  still 
true  for  x  a  canonical  bag,  but  we  would  word  it  as  "x  rr.’nus  x  is  zero".  This  is  because  the 
restriction  of  Bag-difference  to  canonical  bags  (bags  of  T’s)  is  precisely  the  operation  we  call 
subtraction. 

210.  When  Canonize  works  on  PI,  P2  (two  predicatsi),  and  produces  an  operation,  F, 

Both  predicatea  mutt  share  a  common  Domain,  of  the  form  AxA  for  tome  concept  A,  and  the 
new  operation  F  can  have  <A  -» A>  h  one  of  its  Domain/range  entries. 

If  a  canonical  specialisation  (tay  'B')  of  A  is  defined,  then  the  domain/range  of  F  can  Ktuslly 
be  tightened  to  <A  -»  B>,  and  it  it  alto  worth  explicitly  storing  the  redundar4  entry  <B 
-»B>. 


K’" ?w' ^ ^ ^Kr  ^  -^.'i  *."  v\vV^ 
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21 1.  In  the  sam«  situation  as  the  last  rule, 

On*  conjecture  is  that  PI  and  P2  ar*  equal,  when  restricted  to  working  on  pairs  of  B's  [i.e., 
pairs  of  Canonical  A's,  A’s  which  are  in  F(A),  rang*  items  for  F,  items  x  which  ar*  the 
image  F(a)  of  some  a<A]. 

After  canonizing  Equal  and  Same-size  into  the  new  operation  Length,  AM  conjectures  that 
two  canonical  bags  are  equal  iff  they  have  the  same  size. 


Canonize  .  Suggest 


212.  When  Canonize  works  on  PI,  P2,  both  predicates  over  Ax  A,  and  produces  an  operation 

F:A-4A, 

It  is  worth  defining  and  studying  the  image  F(A);  i.e.,  the  totality  of  A's  which  are  canonical, 
already  in  standard  form.  When  this  new  concept  Canonical-A  is  defined,  suggest  the 
task  "Fillin  Dom/range  entries  for  Canonicat*A". 

Thus  AM  studied  Canonical-Bags,  which  turned  out  to  be  isomorphic  to  the  natural 
numbers.  What  we’ve  called  'Canonical-A'  in  this  rule,  we’ve  referred  to  simply  as  ‘B’  in 
earlier  Canonizing  rules. 

213.  If  PI  is  a  very  interesting  predicate  over  AxA,  for  some  interesting  concept  A, 

Then  look  over  PI. Spec  for  some  other  predicate  P2  which  is  also  over  AxA,  and  which  has 
some  interesting  properties  PI  lacks.  For  each  such  predicate  P2,  consider  applying 
Canoniz*(Pl,P2). 


214.  After  producing  F  as  Canonize {Pi,P2)  [both  predicates  over  AxA],  and  after  defining 

Canonical-A, 

It  is  worth  restricting  operations  in  In-dom-of(A)  to  Canonical-A.  Some  new  properties  may 
emerge. 

Thus  after  defining  Canonical-Bags,  AM  looked  at  Bags.ln-dom-of.  In  that  list  was  the 
operation  "Bag-union".  So  AM  considered  the  restriction  of  Bag-union  to  Canonical-bags. 
Instead  of  Bag-union  mapping  two  bags  into  a  new  bag,  this  new  operation  took  two 
canonical-bags  and  mapped  them  into  a  new  bag.  AM  later  noticed  that  this  new  bag  was 
itself  always  canonical.  Thus  was  born  the  operation  we  call  "Addition". 

215.  After  Canonical-A  is  produced, 

It  is  marginally  worth  trying  to  restrict  operations  in  In-range-of(A)  to  map  into  Canonical-A. 

This  gives  an  added  boost  to  picking  Union  to  restrict,  since  it  is  in  both  Bags.In-dom-of 
and  Bags.In-ran-of.  This  rule  is  much  harder  to  implement,  since  it  demands  that  the  range 
be  restricted.  There  are  Just  a  few  special-purpose  tricks  AM  knows  to  do  this.  Restricting 
the  domain  is,  by  comparison,  much  cleaner.  AM  simply  creates  a  new  concent  with  the 
same  definition,  but  with  a  more  restricted  domain/range  facet.  For  restricting  the  range, 
AM  must  insert  into  the  definition  a  check  to  ensure  that  the  final  lesult  is  inside  Canonical- 
A,  not  just  in  A.  This  leads  to  a  very  inefficent  definition. 

216.  After  Canonic^  \  is  produced. 

It  is  worthwhile  to  consider  filling  in  examples  (especially  boundary)  of  that  new  concept. 
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This  is  above  and  beyond  the  slight  push  which  rule  12  gives  that  task. 

Appendix  3.10.  Heuristics  for  dealing  with  the  operation  Substitute 

Note  that  substitution  operations  are  produced  via  the  initial  concepts  called  Parallel-replace 
and  Parallel-replace2.  The  following  rules  are  tacked  on  there. 


Parallel-replace  .  Suggest 


217.  If  two  different  veriebies  are  used  to  represent  the  same  entity,^^  then  substitute  one  for 

the  other. 

This  is  very  important  if  the  two  occurrences  are  within  the  tame  entry  on  tome  facet  of  a 
single  concept,  and  less  so  otherwise. 

The  dominant  variable  should  be  the  one  typed  out  previously  to  the  user;  barring  that,  the 
older  usage;  barring  that,  the  one  closest  to  the  letter  "a”  in  the  alphabet. 

This  heuristic  was  used  less  often  -  and  proved  less  impressive  -  than  was  originally 
anticipated  by  the  author.  Since  most  concepts  were  begotten  from  older  ones,  they  always 
assumed  the  same  variable  namings,  hence  there  were  very  few  mismatches.  A  special  test 
was  needed  to  explicitly  check  for  "x«y"  occurring  as  a  conjunct  somewhere,  in  which  case 
we  removed  it  and  y  substituted  for  x  throughout  the  conjunction. 

218.  If  two  •xproasions  (aspacially:  two  conjacturos)  ara  structurally  similar,  and  appaar  to  diffar 

by  a  cartain  substitution. 

Than  if  tha  substitution  is  parmissabla  wa  have  just  arrivad  at  tha  sama  axprassion  in 
various  ways,  and  tag  it  as  such. 

But  if  tha  substitution  is  not  saan  to  ba  tautologous,  than  a  naw  analogy  is  born.  Associata 
tha  constituant  parts  of  both  axprassions.  This  is  mada  intarasting  if  thara  ara  savaral 
concapts  involvad  which  ara  assignad  new  analogues. 

The  similar  statements  of  the  associativity  of  Add  and  Times  led  to  this  rule’s  identifying 
them  as  analogous.  If  AM  had  been  more  sophisticated,  it  might  have  eventually 
formulated  some  abstract  algebra  concepts  like  "semigroup"  from  such  analogies. 

Appendix  8.11.  Heurifitics  for  dealing  with  the  operation  Restrict 


Restrict .  Fillin 


219.  Whan  filling  in  dafinitions  (algorithms)  for  a  rastriction  R  of  tha  active  concept  F, 

One  entry  can  simply  ba  a  call  on  F.Dafn  (F.AIgs). 

Thus  one  definition  of  Addition  will  always  be  as  a  call  on  the  old,  general  operation  ’Bag- 
union.’  Of  course  one  major  reason  for  restricting  the  domain/range  of  an  activity  is  that  it 
can  be  performed  using  a  faster  algorithm!  So  the  above  rule  was  used  frequently  if  not 
dramatically. 


20 

Wh«n  wt  ««y  th«t  x  ind  y  rtpr«tent  th«  Mtn*  tnlHy,  wh*t  rtilly  m«in  i«  that  they  hava  tha  aama  domain  of  idantity 
(a  VxcBaf'.:/  and  thay  ara  aqually  fraa  (thara  it  a  praciaa  tofical  dafmition  of  all  thia,  but  thara  ia  littia 
point  to  praaantinf  it  hara.) 
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220.  Wh«n  creating  a  restriction  R  of  the  active  concept  F, 

Note  that  R.GenI  should  contain  F,  and  that  F.Spec  should  contain  R. 


221.  When  creating  in  a  restriction  R  of  the  active  concept  F,  by  restricting  the  domain  or  range 
to  some  specialization  S  of  its  previous  value  C, 

A  viable  initial  guess  for  R.Worth  is  F.Worth,  augmented  by  the  difference  in  worth  between 
S  and  C.  Hopefully,  S.Worth  is  bigger  than  C.Worth! 


Appendix  3.12.  Heuristics  for  dealing  with  the  operation  Invert 


Invert  .  Fillin 


222.  When  filling  in  definitions  for  an  Inverse  F*’  of  the  active  concept  F, 

One  "Sufficent  Defn"  entry  can  simply  be  a  blind  search  through  the  examples  of  F. 

If  we  already  knew  that  4-416  is  an  example  of  Square,  then  AM  can  use  the  above  rule  to 
quickly  notice  that  Square-lnverse.Dcfn(16,4)  is  true.  This  is  almost  the  "essence"  of 
inverting  an  operation,  of  course. 


Invert ,  Suggest 


223.  After  creating  an  inverted  form  F*'  of  some  operation  F, 

If  the  only  definition  and  algorithm  entries  are  the  "obvious"  inefficient  ones, 

Then  consider  the  task:  "Fill  in  algorithms  for  F*'",  because  the  old  blind  search  is  just  too 
awful  to  tolerate. 


Appendix  3.13.  Heuristics  for  dealing  with  Logical  combinations 

Eventually,  there  may  be  separate  concepts  for  each  kind  of  logical  connective.  For  the 
moment,  all  Boolean  operators  are  lumped  together  here.  Their  definition  Is  too  trivial  for 
a  ‘Fillin’  heuristic  to  be  useful,  and  even  ‘Check’  heuristics  are  almost  pointless. 


Logical-combine  .  Check 


224.  Th«  usor  may  somatimas  indicata  'Conjunction'  whan  ha  raally  maans  'Rapaating'. 
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Logical-combine  .  Suggest 

225.  If  lh«r«  is  somtfhing  inisrtsfing  to  \'»y  shout  ontitios  satisfying  th«  disjunction  (conjunction) 
of  two  concspts*  definitions, 

Than  consider  creating  a  new  concept  defined  as  that  logical  combination  of  the  two  concepts' 
definitions. 


226.  Given  an  implication, 

Try  to  weaken  the  left  side  as  much  as  possible  without  destroying  the  validity  of  the  whole 
implication.  Similarly,  try  to  strengthen  the  right  side  of  the  implication. 


Logical-combine  .  Interest 

227.  A  disjunction  (conjunction)  is  interesting  if  there  is  a  conjecture  which  is  very  interesting  yet 

which  cannot  be  made  about  any  one  disjunct  (conjunct). 

In  other  words,  their  logical  combination  implies  more  than  any  consituent. 

228.  An  implication  is  interesting  if  the  right  side  is  more  interesting  than  the  left  side. 


229.  An  implication  is  interesting  if  the  right  side  is  interesting  yet  unexpected  based  only  on 
assuming  the  left  side. 


Appendix  3.14.  Heuristics  for  dealing  with  Structures 


Structure  .  Fillin 

230.  To  fill  in  examples  of  a  Kind  of  structure  $, 

Start  with  an  empty  S,  pluck  any  other  member  of  Examples(Structure),  and  transfer 
members  one  at  a  time  from  the  random  structure  into  the  embryonic  S.  Finally,  check 
that  the  resultant  $  really  satisfies  $.0efn. 

This  is  useful,  e.g.,  to  convert  examples  of  lists  into  examples  of  sets. 

231.  To  fill  in  specializations  of  a  kind  of  structure, 

add  a  new  constraint  that  each  member  must  satisfy,  or  a  constraint  on  all  pairs  of  members, 
or  a  constraint  on  all  pairs  of  distinct  members,  or  a  constraint  which  the  structure  as  a 
whole  must  satisfy.  Such  a  constraint  is  often  merely  a  stipulation  of  being  an  example 
of  an  X,  for  some  interesting  concept  X. 

Thus  AM  might  specialize  Bags  into  Bags-of-primes,  or  into  Bags-of-distinct-primes,  or  into 

Bags-containing-a-prime. 


App«ndix  3 


AM  Oitcoviry  in  Mtthtmttict  a*  Haurialic  Starch 


-27A- 


Structure  .  Interest 

232.  Structur*  S  is  mildly  intsrsstins  if  all  mambtrs  of  S  satisfy  tho  iniorosting  predicate  P,  or 

(equivalently)  if  they  are  all  accidentally  examples  of  the  interesting  concept  C,  or 
(similarly)  if  all  pairs  of  members  of  S  satisfy  the  interesting  binary  predicate  P,  etc. 
Also:  a  KINO  of  structure  is  interesting  if  it  appears  that  each  instance  of  such  a  structure 
satisfies  the  above  condition  (for  a  fixed  P  or  C). 

Thus  a  singleton  is  interesting  because  all  pairs  of  members  satisfy  Equal.  The  concept 
"Singletons"  is  interesting  because  each  singleton  is  mildly  interesting  in  just  that  same  way. 
Similarly,  AM  defines  the  concept  of  a  bag  containing  only  primes,  because  the  above  rule 
says  it  might  be  interesting. 

233.  A  structure  is  mildly  intarosting  if  on#  mambar  is  vary  intarasting.  Evan  battar:  exactly  one 

member. 

Also:  a  KIND  of  structure  is  interesting  if  each  instance  satisfies  the  above  condition  in  the 

same  way. 

Thus  the  values  of  ADD*'  are  interesting  because  they  always  contain  precisely  one  bag 
which  is  a  Singleton. 

Appendix  3.15.  Heuristics  for_  dealing  with  Ordered-structures 


Ordered-struc  .  Fillin 


234.  To  fill  in  some  new  examples  of  the  ordered  structure  S,  when  some  already  exist, 
Pick  an  existing  example  and  randomly  permute  its  members. 


235.  To  fill  in  specializations  of  a  kind  of  ordered  structure, 

add  a  new  constraint  that  each  pair  of  adjacent  members  must  satisfy,  or  a  constraint  on  all 
ordered  pairs  of  members  in  the  order  they  appear  in  the  structure.  Such  a  constraint 
is  often  merely  a  stipulation  of  being  an  example  of  an  X,  for  some  interesting  concept 
X. 

Thus  Lists-of-numbers  might  be  specialized  into  Sorted-lists-of-numbers,  assuming  AM  has 
discovered  "s"  and  assuming  it  Is  chosen  as  the  'constraint’  relationship  between  adjacent 
members  of  the  list. 


Ordered-struc  .  Check 


236.  If  the  structure  is  to  be  accessed  sequentially  until  some  condition  is  met,  and  if  the  precise 
ordering  is  superfluous. 

Then  keep  the  structure  ordered  by  frequency  of  use,  the  most  useful  element  first. 

This  is  a  simple  data-structure  management  trick.  If  you  have  several  rules  to  use  in  a 
certain  situation,  and  rule  R  is  one  which  usually  succeeds,  then  put  R  first  in  the  list  of 
rules  to  try.  Similarly,  in  a  pattern-matcher,  try  first  the  test  most  likely  to  detect  non¬ 
matching  arguments. 
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237.  If  tiructur*  $  it  tlwtyt  to  bo  mtinteinsd  in  tiphinumoric  ordor, 

Thon  AM  ctn^'  Kiuiliy  mtintain  it  at  an  unordorod  tiruciuro,  if  doairad. 

Luckily  this  heavily  implementation-dependent  rule  was  never  needed  by  AM. 


Ordered-struc  .  Interest 

238.  An  ordorod  atructuro  S  ia  intoroating  if  OKh  adjKont  pair  of  mombora  of  S  aatiafioa 
prodicato  P  (for  tomo  rare,  intoroating  P). 

When  AM  discovers  the  relation  it  immediately  thinks  that  any  mttd  list  of  numbers  is 
more  interesting,  due  to  the  above  rule. 

Appendix  S.i6.  Heuristics  for  dealing  with  Unordered-structures 


Unord-struc  .  Check 

239.  To  chock  an  oxampio  of  an  unordorod  atructuro, 

Enturo  that  it  ia  in  alphsnumorically*sortod  ordor.  ■<  nut,  thon  SORT  it. 

Ail  unordered  objects  are  maintained  in  lexicographic  order,  so  that  two  of  them  can  be 
tested  for  equality  using  the  LISP  function  EQJLJAL.  Because  of  this  convention,  any  two 
structures  can  therefore  be  tested  for  equality  using  this  fast  list-structure  comparator. 

Appendix  8.17.  Heuristics  for  dealing  with  Multiple-eles-structures 


Mult-ele-struc  .  Fillin 


240.  To  fill  in  aotno  now  oxamploa  of  tho  atructuro  $,  whoro  S  ia  a  atructuro  admitting  multipio 
occurroncoa  of  tho  aamo  olomont,  whon  aomo  oxamploa  alroady  oxiat. 

Pick  an  oxiating  oxampio  and  randomly  chango  tho  multiplicity  with  which  variout  mombora 
occur  witMn  tho  atructuro. 


Appendix  8.18.  Heuri8tic8_£or_ dealing  with_Set8 
Sets  .  Suggest 

241.  If  P  ia  a  vary  intoroating  prodicato  ovor  X, 

Thon  atudy  thoao  two  aotat  tho  mombo!  a  of  X  for  which  P  holda,  and  tho  Onoa  for  which  P 
faila. 


21 


Aw  to  Iha  current  IBP  iwplciiwntition  cf  Ait«-*truetur«* 
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Whlle  we  humans  know  that  this  partitions  X  into  two  pieces,  AM  is  never  explicitly  told 
this.  It  would  occasionally  be  surprised  to  discover  that  the  union  of  two  such  complements 
"accidentally"  coincided  with  X.  Incidentally,  this  rule  is  really  the  key  linkage  between 
predicates  and  sets;  it  is  close  to  the  entry  on  SetView  which  tells  how  to  view  any  predicate 
as  a  set. 


Sets  .  Interest 

242.  A  set  S  is  interesting  if,  for  some  interesting  predicate  P,  whose  domain  is  X, 

S  ecci^ntsliy  appears  to  be  reiated  strongiy  to  {x<X  |  P(x)},  i.e.,  to  those  members  of  X 
satisfying  P. 

To  the  surprise  of  the  author,  this  rule  never  found  application  in  any  of  AM's  runs. 
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Appendix  4.  Maxiraally-Divisible  Numbers 


The  honor  of  your  machine  is  preserved. 

Erdot* 


This  appendix  discusses  a  discovery  motivated  by  AM:  a  new  little  bit  of  mathematics  that 
was  discovered.  It  is  presented  as  if  it  were  a  math  Journal  article,  in  a  fairly  formal  way, 
almost  unmotivated. 

After  the  concept  was  discovered,  the  author  learned  that  Ramanujan,  a  self-taught  Indian 
mathematician,  had  worked  on  similar  issues  early  in  this  century.  The  final  subsection 
contains  a  relaxed  summary  of  what  AM  did,  what  the  author  did,  and  what  Ramanujan 
did. 


Appendix  4.1.  A  Meaningful  Question 

We  begin  by  asking  the  question,  "What  is  the  converse  concept  to  prime  numbers?"  If  we 
define  ^primeness"  to  mean  that  a  natural  number  has  as  few  divisors  as  possible  (tiamely, 
Just  two  of  them:  1  and  itself),  then  the  converse  kind  of  number  would  be  one  which  had 
an  abnormally  iarge  number  of  divisors. 

One  could  consider  the  following  set  M  of  maximally-divisible  numbers: 

M  -  {x<N"  I  (Vy<x)  (  d(y)  <  d(x) ) } 


where  d(n)  is  the  number  of  divisors  of  n,^  is  the  set  of  positive  integers,  and  the  vertical 
bar,  V  is  read  'such  that’. 

In  words,  this  says  that  M  is  the  set  of  alt  positive  integers  satisfying  the  property  that  every 
smaller  number  has  fewer  divisors.  That  is,  we  throw  into  the  set  M  a  number  x  if  (and 
only  if)  it  has  more  divisors  than  any  smaller  number.  So  I  gets  thrown  in  (the  smallest 
number  with  I  divisor),  2  (having  2  divisors),  4  (3  divisors,  namely  1,  2,  and  4),  6  (4 
divisors),  12  (6  divisors),  etc.  Another  way  to  specify  M  is  as  the  set  containing  (for  alt  n) 
the  smallest  number  having  at  least  n  divisors.  Notice  that  we  are  not  going  to  include  "the 
smallest  number  with  precisely  5  divisors",  since  this  number  (which  happens  to  be  2^  or  16) 
is  bigger  than  12  (which  has  6  divisors).  So  no  number  in  M  has  precisely  five  divisors. 


*  Rtmtrktd  by  Pt«H  Erdot,  afttr  •xtmimnc  nim  of  Ihio  oppondht’o  mitoriol 
2E.|,d(l2).l|'’,2AA6,12)ll-6. 
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One  of  the  questions  at  the  heart  of  our  study  is  "Given  d,  what  is  the  smallest  number 
with  at  least  d  divisors?" 

How  can  we  even  start  on  this  question?  The  most  powerful  tool  we  have  is  the  following 
combinatorially-proved  theorem: 

(Tl)  If  we  write  n  as  2*'S**  "pj^**‘,  then  d(n)  -  (a|^lXa2*l)4a4,^I)- 

Our  central  question  could  be  answered  if  we  could  somehow  invert  this  formula  into  one 
which  expressed  n  as  a  function  of  d,  and  then  found  the  minima  of  that  function  n(d). 
Coupled  with  the  knowledge  that  each  number  can  be  factored  uniquely  into  prime  factors, 
TI  provides  a  closed-form  way  of  manipulating  d(n).  That  is,  n  is  really  a  function  of  the 

sequence  of  exponents  when  written  as  2*‘s**~^  so  we  can  consider  n  -  n(a|,  ag,...).  Then 
Tl  is  really  a  way  of  expreuing  d(n)  -  d(a|,  aor-.)- 
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Appendix  4,2.  Special  Case;  n  ■  2^3^ 

Let’s  consider  a  special  case.  We'll  restrict  our  attention  to  numbers  n  which  are  of  the  form 

2^3^.  So  Tl  says  that  d(n)-(a4lXb4l).  Consider  fixing  d,  and  asking  how  n  varies  with  a 
and  b.  Notice  that  we  are  now  saying  that  (a4lXb4l)-d-constant.  So  we  can  say  that 
(b4l)-d/(a4l),  so  b-(d/(a4l))-l.  So  our  number  n  is  really  Aha!  This  is  an 

expression  for  n  simply  as  a  function  of  a.  We  can  use  calculus  to  find  the  minima  of  this 
function.  That  will  correspond  to  the  optimal  exponent  a,  from  which  we  can  compute  the 
optimal  b. 

dnfd*  -  2«(3*»(-d/(a4l)2)lof(3))  4  3W/(«*1))"1(2«Io|(2)) 

>[(a*l)log(2)  -  (b*l)log(3)](n/(i4l)). 


This  is  zero  when  (a4|)log(2)  -  (b4l)log(3). 

So  we  have  two  equations  now: 

(•4l)  ■  (b4l)log(3)/log(2) 

(a4l)>d/(b4l) 

Together  they  say  that  (b4l)log(3)/log(2)  -  d/(b4l),  from  which  we  derive  (b4l)2  « 
log<2)d/log<3).  Substituting  this  back  in,  we  also  get  that  (a4l}^  >  log(3Xl/log(2). 

If  real-valued  exponents  were  allowed,  our  optimal  n(d)  would  be; 

2SQRTItftef (3)  /  loi<2)J  ,  3SQRTI<rlof<2)  /  lot(3)) 

Three  observations  we  can  make  from  intuition  -  and  Justify  from  reality  -  are  (i)  this 
optimal  real  value  is  better  than  (i.e.,  S)  any  integral  n  (divisible  only  by  2  and  3)  with  at 
least  d  divisors,  (ii)  the  ideal  n  is  very  close  to  the  best  such  integral  n.  Oil)  the  best  such 
integral  n  will  have  exponents  a  and  b  which  are  close  to  our  theoretical  real-valued  "ideal" 
a  and  b. 

For  example,  if  we  choose  to  ask  for  a  number  with  at  least  8  divisors,  our  theoretical 
values  for  a  and  b  are  about  2.6  and  i.2;  the  ideal  n  is  then  about  23.  In  actuality,  the  first 
number  with  8  or  more  divisors  is  24,  and  it  is  factored  into  2®3‘  (i.e.,  the  best  integral 
values  for  a  and  b  are  3  and  1,  respectively). 


Let’s  consider  a  second  special  case.  We'll  restrict  our  attention  to  numbers  n  which  are  of 

the  form  So  Tl  says  that  d(n)-(a«lXb+lXc«l)>  Consider  fixing  d,  and  asking  how  n 

varies  with  a,  b,  and  c.  Notice  that  we  are  now  saying  that  (a^lXb^lXc'^O-d-constant.  So 
we  can  say  that  (cfl)-d/(a«lXb«l),  so  c-(d/(a«iXb«i))-i.  So  our  number  n  is  really 
2<3b5((d/(t*ixiHi»-i] 

Viewing  c  as  a  function  of  a  and  b,  we  can  compute  the  differential 
dc  -  d(d/(a^lXb«I)) 

-  d«W(l/(a+lXb+l))] 

-  d«[(l/(a*l>X-l/(b+l)2)db  ♦  (l/(b*l)X*l/(a*l)2)</a] 

-  (.(c*l)/(b+l))db  ♦  (-(c*l)/(a*l))da 


We  want  to  minimize  this  function  n-n(a.b).  It  will  turn  out  to  be  easier  to  find  the  minima 
of  log(n),  viewed  as  a  function  of  a  and  b.  The  minima  of  n  will  occur  precisely  at  the 
minima  of  log<n).  So  to  find  the  solutions  to  dn  •  0,  we  Just  find  the  solutions  to  dLOGn  •  0. 
Now  log<n)  -  lo^2)a  *  log(3)b  ♦  iog(5)c.  So  the  differential  dLOGn  •>  log(2)da  ♦  log(3)db  * 
log(5)dc.  Substituting  in  the  value  we  obtained  for  dc,  we  get 

dLOGn  •  lof(2)da  «  lo|(3)db « lo|(S}[(-(c»i)/(b*l}}db  *  (-(e*l)/(a«li)da] 

•  [log(2)-(cM)log(5)/(a*l)]de  *  [leg(3)-(c*l)log(5)/(b*l)]db 


One  nice  way  to  make  this  identically  zero  is  if  the  coefficients  of  da  and  db  become  zero. 
That  is,  n  will  have  minima  when  both  log(2)  -  (c*i)log(5)/(a«l)  and  log<3)  - 
(c*l)log(5)/(b+l)  are  true.  That  is,  when  (a+l)log(2)  -  (b+l)log<3)  ■  (c+i)log(5). 

This  is  a  generalization  of  the  earlier  result  that  minima  occur  when  (a4^t}log<2)  * 
(b+i)log(3}.  We  can  easily  see  that  the  general  pattern  of  the  constraints  are:  (a|«l)/(aj4l)  - 

log<pj)/log<pj). 

What  are  the  explicit  formulae  for  the  exponents  in  the  k-3  case?  We  can  solve  for  them 
in  terms  of  d  by  using  Tl.  Namely, 

(•*l)(b*l)(c*i)  a  d 

(•♦l)a(flXog(5)/log(2) 

(b*l)  ■«  (c«l)log(5)/log(3) 

Substituting  the  last  two  equations  into  the  first,  we  get  (c^l)^  (log(5))^  •  d  log(2)log(3). 
Hence  c+l  -  CUBEROOTCd  log(2)  log(S)  /  log2(5)]. 
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For  reasons  of  symmetry,  we  will  transform  this  siightiy  into 
t*l  >  CUBEROQT[  d  tof  (2)  logO)  log(5)  ]  /  log(5) 

The  nicely  symmetric  equations  for  a+I  and  b+1  turn  out  to  be; 

■*1  -  CUBEROOTt  d  log(2)  log(3)  log(5)  ]  /  log(2) 
b*l  -  CUBEROOn  d  log(2)  log(3)  log(5}  ]  /  iog(3) 

Viewed  in  this  way,  we  can  rewrite  our  equation  from  the  k-2  case  into  the  same  kind  of 
expression,  namely: 

•*I  >  SQUAREROOT[  d  log(2)  log(3)  ]  /  log(2) 
b«l  •  SQUAREROOT[  d  log(2)  log(3}  ]  /  log(3) 

Again  the  general  pattern  seems  to  be  evident: 

•j*!  -  K<>’R00T[  d  iog(2)  log(3)...log(pn)  ]  /  log(p|) 

As  in  the  k>2  case,  the  equations  for  a,b,c  have  real  correspondence  to  the  optimal  integral 
values  of  the  exponents. 

Appendix  4.4.  The  General  Case 

We  are  now  ready  to  consider  the  most  general  case,  namely  when  n  -  2®'3®^"  p)^^.  By  T 1, 
we  know  that  d(n)  -  (aj+lXa2+l)...(a^*l).  One  generalization  of  our  earlier  work  would 
indicate  that  minima  of  n  (for  a  given  value  of  d)  occur  whenever 

{T2)  [for  all  i  and  J  between  1  and  k](aj+l)/(aj*I)  -  log(pj)/log(pj). 

This  is  really  a  set  of  k-1  equations  in  the  k  different  variables  a|,...,a]^.  Using  the  formula 
for  d  which  T1  provides,  we  can  solve  this  system  of  equations  for  each  a^  in  terms  only  of 
d.  The  resulting  formulae  are: 

(T3)  [Vi£k]  aj4l  .  d  log(2)  log(3)Jog(pj^)  3  /  log(pj) 

The  deriviation  of  T3  from  T2  is  straightforward.  Below  is  the  proof  of  T2,  due  to  Knuth. 
He  uses  Lagrange  multipliers. 

The  task  is  to  minimize  n,  for  a  given  d.  It  suffices  to  find  the  minima  of  log(n).  Thus 
we  wish  to  minimize  ajlog(pjK.+aj^log(pi^),  for  a  given  value  of  d«(a|4l)*...»(aj^4l). 
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But  this  tatter  constraint  means  that  A*[(a|sl)...(a|(+1)  -  d]  is  identically  zero  for  any 
value  of  X  Thus  we  may  view  our  problem  as  the  minimization  of 
a|log<pi)s...sakiog<Pl()  *  A'Kai^D^ak^l)  -  d].  For  any  i,  the  partial  derivative  of  this 
with  respect  to  aj  is  log(p|)  *  X*[(a|4|)...(ak«i)]/(aisl}.  At  an  extremum,  all  such  partial 
derivatives  vanish.  That  is,  for  any  i,  log<pj)  *  X*[(a]sI)...(ak«I)]/(a|si)  «  0.  This  says 
that  (aj4l)i(^pj)  -  -A*[(a|4l)...(ak4l)l  So,  for  any  i,  (aj4l)iog<p|)  -  -x^d.  Since  A  and  d 
are  independent  of  i,  this  proves  T2. 

Now  that  we  know  T2  and  T3  to  be  true,  we  can  actually  compute  the  optimal^  values  for 
n.  It  will  simplify  matters  again  to  consider  only  log(n)  for  the  moment,  [note:  SIGMAj(...) 

means  "the  sum,  from  !■!  to  l»k,  of ..."]  Now 


log(n)  -  ailog(2)  4  a2log<3)  4,..4  aklog(pk) 


-  SIGMAi[log(pj)  aj] 

-  SIGMAi[log(piX(K‘hROOT[  d  log(2)  log(S)...log{pk)  l/logfpj))  •  1)] 

-  SIGMAi[K*hROOT(  d  log(2>  log<S)„.log(pk) )  •  log<pj)] 

-  ktKthROOT(  d  log(2)  log(3)...log(pk))]  •  SIGMAj(log(pi)] 

This  then  gives  the  nice  result: 

W  n  .  {.[“  X»W)OTW)  K"H!OOT(lo,e)lo,(3),J.|(|.,))l)  /  |„(3)j.,(p,)) 

If  we  let  Fk  represent  the  product  of  the  first  k  primes,  then  this  says 
n  .  {e[k  K‘hROOT(d)  K‘hROOT(Fk]}  / 


If  we  let  Gk  be  gtk  K^^ROOT(log(2)...l<^Pk))].  then  this  becomes 


■  -JV-  rw  A.  -V  %4  %  i  i  *•  I  ^ 
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(T4)  „  .  WK'I’ROOW)])  /  F,. 


So  by  tabulating  Gj^  and  F)^,  we  can  efficiently  compute  the  ideal  value  for  n  (and  for  each 
aj)  given  a  desired  d  and  allowable  k. 

Notice  that  if  we  are  allowed  more  and  more  distinct  prime  factors,  that  is,  as  k  grows,  the 
real-valued  exponents  ai  get  smaller  and  smaller,  until  finally  they  become  negative,  and  we 

have  broken  all  ties  to  reality.  Empirically,  the  ideal  value  seems  to  be-obtained  when  no 
exponent  is  allowed  to  be  below  0.5;  in  those  cases,  the  ideal  real  value  for  n  is  both  close  to, 
and  slightly  lower  than,  any  intergral  solution. 

Of  course  this  is  not  satisfactory;  what  we  now  need  is  a  formula  which  tells  us,  for  a  given 
d,  how  many  distinct  prime  factors  any  n  must  have,  in  order  to  have  d  divisors.  That  is, 
we  would  like  to  know  k  as  a  function  of  d,  or  k  as  a  function  of  n.  Luckily,  k(n)  is  a  very 
slowly  changing  function. 

For  the  numbers  of  form  n»2*3*5». , .  'pi^,  we  can  see  from  T1  that  d-2^.  For  maximally- 

divisibles,  it  seems  likely  that  d  will  in  general  be  larger;  say  it  is  of  the  form  d-0^  (where  (i 
is  trivially  seen  to  be  22).  :  Then  we  can  plug  this  into  (e): 

n  -  e{t«og(d)/log(a)lI  K‘hROOT(log2  log3...log  p^]}  /  2.3.6...p,^ 

But  the  geometric  mean  is  roughly  log(p](),  which  is  about  iog(log(d)).  Also,  the  product  of 

the  first  k  primes  is  roughly  k^,  which  is  about  [iog(d)/log(|3)][logiog(d)-logiog(|3)].  Putting 
these  into  the  last  equation,  we  get: 

n  ’>  e{tlog(d)/log((i)XfJ-i)iog(log(d))} 
n  ^  c[Wog<d)(0-O/ iog(fJ)} 

If  the  best  we  can  do  is  the  trivial  result  that  fI22,  then  we  obtain  the  already-known 
relation^  that 

n  >  f 

If  we  can  show  that  k  is  at  least  3,  then  these  n’s  Jump  to  the  squares  of  their  former  values. 


a 


Thi*  M  in  f«et  Dm  fhtrpttt  bound  hithorto  known  for  n<d).  H  wm  proviouoly  dorivod  much  moro  forivouoly,  uoing  nMthodo 
not  rolttod  to  Iht  ctleuluf. 
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This  would  be  a  much  better  bound,  of  course.  In  general,  the  sharpest  bound  will  be 
found  by  determining  (i  sharply. 

Appendix  4.5.  An  even  Stronger  claim 

A  very  constructive  answer  to  this  whole  development  could  be  provided  if  the  following 
were  true: 

(T5)  The  set  M  of  maximally  divisible  numbers  coincides  precisely  with  the  set  of  integers 
obtained  in  the  following  manner: 

(1 )  For  oach  natural  number  d,  um  T3  to  compute  the  optimal  exponenta  for 
n(d,K),  with  k  at  large  ac  possible  such  that  no  a|  Is  below  0.5 

(2)  Round  each  exponent  to  the  nearest  integer,  and  compute  the  corresponding  n. 

Add  this  n  to  the  set. 


There  is  probably  a  nice  closed-form  formula  for  such  numbers,  a  sort  of  "compiled" 
version  of  T3  and  T5.  This  is  then  the  desired  characterization  of  M.  Exhaustive  search 
has  in  fact  confirmed  T5  for  all  d  below  1500.^  T5  has  the  advantage  of  being  intuitively 
clear.  Perhaps  its  proof  will  turn  out  to  be  nontrivial  or  nonexistent.  I  leave  it  as  "AM’s 
conjecture".  This  is  so  far  the  only  piece  of  interesting  mathematics,  previously  unknown, 
that  was  directly  motivated  by  AM. 

For  example:  consider  d»1344.  The  largest  that  k  can  be  without  1'3  calling  for  <  0.5  is 
k-7.  For  this  d  and  k,  T3  predicts  exponenu  5.9,  3.3,  2.0,  1.4,  1.0,  0.9,  and  0.7.  Rounding 

this  off,  we  get  6,  8.  2,  1,  1.  1.  1.  Next  we  compute  2^33527^1*  13*17*.  This  is  735,134,400. 
T1  tells  us  that  this  has  in  fact  precisely  1344  divisors  (coincidence).  Exhaustive  search  tells 
us  that  no  smaller  n  has  that  many  divisors  (this  is  a  verification  of  T5).  Incidentally,  the 
ideal  real  value  for  n  (for  this  k  and  d  value,  using  («))  is  603,696,064.  Notice  that  it  is  close 
to,  and  less  than,  the  best  possible  integral  n  with  1344  divisors. 

Another  such  "rounded-exponent"  number  is 

n-2*3V72il2ls‘l7‘l9‘23‘29'3l'37'4l'43'47'53'.  The  progression  of  its  exponents*!  (9  6 
4332222222222  2)  is  about  as  close  as  one  can  get  to  satisfying  the  "logarithm" 
constraint.  By  that  I  mean  that  9/6  is  close  to  log(9)/log(2);  that  2/2  is  close  to 
log(47)/log(43},  etc.  Changing  any  exponent  by  plus  or  minus  1  would  make  those  ratios 
worse  than  they  are.  This  number  n  is  about  25  billion,  and  has  about  4  million  divisors. 
The  AM  conjecture  is  that  there  is  no  smaller  number  with  that  many  divisors. 

Incidentally,  the  average  number  in  the  neighborhood  of  n  has  about  2*°?*°?  ”  divisors 
(about  9  divisors,  for  numbers  near  this  n). 


^  Th*  Oftty  f*uft  it  th«t  th*  number  4  w  in  M,  y«<  i*n’t  found  by  (hi*  proeodurt.  Thi*  mty  b«  duo  to  orroro  occurring  near 

tmaM  Moforo,  or  H  may  portend  the  eccaaionai  (parhapa  mfinitaly  often)  faHura  of  thia  procedure  T5. 
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Aopendix  4.6.  AM  and  Ramanujan 


/  then  adopt  a  different  point  of  view  (from  Dirichlet,  Wigert,  and  other 
mathematicians  who  have  studied  din}}.  /  define  a  highly  composite  number  as  a 
number  whose  number  of  divisors  exceeds  that  of  all  its  predecessors. 

*■  Ramanujan 


What  AM  did;  A  M  defined  the  set  M,  of  maximally-divislble  numbers.  It  thought  the  set 
might  prove  interesting.  AM  found  several  members  of  M.  It  had  recently  learned  about 

unique  factorization,  so  it  factored  each  member:  each  number  n-2^'...p|''  was  replaced  by 
the  sequence  <a|,...,a|^>.  While  factored  in  this  form,  a  rough  kind  of  regularity  was 

noticed.  AM  didn’t  have  the  concepts  of  logarithms,  exponentiation,  e,  analyticity,  reals, 
continuity,  etc.,  so  it  couldn’t  possibly  carry  this  work  much  further. 

What  the  author  did  (aided  and  abetted  by  Randall  Davis):  Noticing  the  general  pattern  in 
these  sequences*,  the  author  developed  the  results  which  were  described  in  the  past  few 
subsections.  The  major  results  are  as  follows: 

1.  The  smallest  possible  number  n  with  d  or  more  divisors  (where  n  is  of  the  form 

n-2*'...pj[^)  is  e'‘’'^*'’RWd*lo*2*loi3*...«lo|(pK)}/2.3.„..p^'  This  is  a  real 

number,  which  is  a  good  lower  bound  on  n(d)  (the  smallest  n  with  d  or  more 
divisors).  If  k  is  approximable  as  log(d)/log(0),  for  some  (3  (we  know  0  is  bigger 
than  2),  then  the  preceding  formula  can  be  simplified  into: 

The  higher  one  can  prove  0  (>2)  is,  the  better  this  result. 

2.  For  such  "idealized"  real  values  of  n(d),  the  exponents  aj  of  the  prime  factors  of  n 

can  be  computed  by  the  formulae:  ij*!  ■  K^^ROOT{d«lo|(2)«io{(3)*...*ios(p|^})/los(Pj). 
These  exponenu  satisfy  the  well-known  relationship  that  the  product  of  the  (aj^^O’s 
is  equal  to  d.  They  also  satisfy  the  lesser-known^  relation  that  (aj4l)*iog(pj)  is  a 
constant  (the  same  for  all  values  of  the  index  i). 


*  Namely,  they  Mcmed  (e  bt  detiribifale  tt:  <bi|  no.,  medium  no.,  medum-tmen-no,-,  2,  2,  1,  tr-,  t> 
^  I  Ibeuiht  tbie  wee  ’unknown”,  but  leter  found  that  Ramanujan  had  found  a  vary  aimtlar  ralationahip. 
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3.  The  elements  of  M  appear  to  be  those  same  numbers,  but  with  the  above  real- 
valued  exponents  (the  aj's)  rounded  to  the  nearest  integer. 

What  Ramanujan  didt  Very  recently,  the  author  was  directed  to  the  work  of  Srinivasa 
Ramanujan  Aiyangai  This  Indian  mathematician  was  essentially  self-taught.  Receiving 
little  formal  education,  he  had  almost  no  contact  with  Western  number  theory.  Yet  he 
became  interested  in  number  theoretic  issues,  and  re-derived  much  of  that  field  all  by 
himself.  In  that  way,  he  is  perhaps  the  closest  human  analogue  to  AM:  he  had  ability, 
techniques,  background  knowledge,  and  he  was  left  to  explore  and  redevelop  much 
elementary  mathematics  on  his  own.  Let  me  quote  from  G.  H.  Hardy’s  final*  sketch  of  this 
genius: 


"The  limitations  of  his  knowledge  were  as  startling  as  its  profundity...  Here 
was  a  man  who...had  found  the  dominant  terms  of  many  of  the  most  famous 
problems  in  the  analytic  theory  of  numbers,  and  yet...his  ideas  of  mathematical 
proof  were  of  the  most  shadowy  description.  All  his  results,  new  or  old,  right  or 
wrong,  had  been  arrived  at  by..intuition  and  induction  from  numerical 
examples." 

It  was  exciting  to  learn  that  Ramanujan  had  also  defined  the  very  same  set  M,  which  he 
called  highly-composite  numbers.  His  great  interest  in  them  has  been  almost  unique*  within 
mathematics  circles  -  until  AM  was  M  to  consider  them.  In  an  article  published  in  1915, 
Ramanujan  derives  the  relationship:  aj*log(pj)-const,  which  he  says  holds  approximately, 

for  values  of  i  which  are  much  smaller  than  k.  He  establishes  this  using  inequalities  (and 
using  the  distribution  of  prime  numbers  n(x)).  Thus  it  has  a  different  flavor  from  the 
similar  relationship  derived  using  calculus  (#2  above,  and  also  found  as  statement  T2  a 
couple  pages  ugo).  Also,  Ramanujan  at  this  point  went  off  on  a  different  path,  and  missed 
the  other  two  results  listed  above  (•!  and  tS).  Instead,  he  defined  a  specialization  of  this 
concept,  which  he  called  'superior  highly-composite  numbers’,  and  investigated  them  <n 
detail.  Neither  AM  nor  the  author  had  the  s^histication  to  make  that  definition  and  to 
find  the  results  which  Ramanujan  subsequently  uncovered  about  superior  highly-composite 
numbers. 


*  Taken  from  Ramanujan’s  obituary  notica,  IS2I.  Found  in  tha  prafKa  to  [Rairanujan  27]. 
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Appendix  5.  Traces  of  AM  in  Action 


Thers  are  three  types  of  traces  which  are  represented  in  this  appendix.  First  comes  a  high- 
level  prose  desription,  a  comtnentary  on  AM  as  it  goes  through  a  long,  successful  run.  This 
is  an  expanded  version  of  the  historian's  descr-ption  of  AM  as  a  mathematician,  found  in 
Section  1.3,  on  page  10. 

Next  comes  a  detailed  description  of  what  AM  did,  on  a  numbered  task-by-task  basis.  This 
is  an  expanded  version  of  the  task-by-task  trace  given  in  Section  6.1,  on  page  115.  For  each 
task,  a  paragraph  is  provided  explaining  v  hat  AM  did,  why,  and  what  happened.  These 
task  summaries  start  on  page  294.  The  task  numbers  listed  there  correspond  to  the 
numbering  in  Section  6.1. 

Finally,  several  pages  of  traces  are  presented  in  totally  undoctored  form,  so  the  reader  can 
see  that  (i)  it  is  harder  to  follow  than  the  slightly  rephrased  ones,  and  yet  (ii)  those  earlier, 
"doctored"  traces  didn’t  enhance  (or  alter  the  the  spirit  of)  what  AM  printed  out. 

Appendix  5.1.  Pfose  Traces 

In  this  section  are  sketched  many  of  the  paths  which  AM  explored  during  its  runs.  Besides 
describing  what  AM  did,  this  section  will  explain  why,  and  indicate  where  each  path  led. 
Along  the  way,  some  concepts  were  created  which  were  interesting  to  us  (in  the  smug 
wisdom  of  millenia  of  hindsight)  but  which  AM  never  bothered  to  develop.  These  will  be 
noted,  and  a  stab  will  be  made  to  apologize  for  AM*.  A  few  exciting  moments  occurred 
when  AM  became  Interested  in  a  concept  which  had  been  ignored  by  humans  -  at  least, 
unknown  to  the  author.  Very  often  the  "new  discovery"  was  never  shown  to  be  anything 
other  than  cute  (e.g.,  the  concept  of  Timberline;  see  page  133  for  a  definition  and  diagram 
of  it). 

AM  began  by  exploring  structures  and  structural  operations.  After  it  was  started,  with  the 
base  of  knowledge  outlined  in  the  previous  chapter,  the  main  activity  AM  did  for  the  first 
several  minutes  was  to  fill  in  examples  of  various  kinds  of  structures  (e.g..  Sets),  structural 
operations  (e.g..  Insert),  and  create  new  concepts  of  this  type  (e.g.,  Singleton).  In  more  detail, 
here  is  what  was  done: 

Trying  to  fill  in  examples  of  set-operations,  AM  kept  failing  because  there  were  no  known 
examples  of  Sets  lo  "run"  those  operations  on.  So  it  turned  to  filling  in  examples  of  Sets. 
Some  of  these  came  from  the  recursive  definition  of  a  set:  "S  is  a  set  if  S-{}  or  if  both  (i)  we 
can  pull  an  element  y  out  of  S  using  Some-member,  and  (ii)  &et-delete(y,S)  is  a  stt".  A 
heuristic  rule  knew  to  extr,ict  the  base  case  from  such  a  definition,  yielding  {}  as  one 
example  of  a  set.  Another  heuristic  said  to  run  operations  whose  range  is  ‘Sets'.  One  of 
these  is  Set-insert.  So  a  procedure  for  getting  a  new  set  is  to  take  the  given  set  S,  and 
anything  y,  and  run  Set-insert(y,S).  When  this  was  done,  using  $■{},  a  bunch  of  singletons 


I 


Th«  typical  axcuaa  ia  that  AM  waa  diatractad  at  that  momant  by  aoma  avan  mora  intaraatinf  taak. 
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wer»’  crested.  AM  literally  collected  Example$(Anythlng)  and  randomly  chose  members  y 
frori!  that  big  and  varied  assortment.  Other  set  operations  were  run  just  to  provide  some 
additional  examples  of  Sets.  Not  every  attempt  wv^  suctessful,  of  course:  one  heuristic  said 
to  find  some  examples  of  Lists,  and  then  use  the  Viax.'  facet  of  Sets  to  transform  them  into* 
sets.  Unfortunately,  there  were  no  examples  of  any  other  kinds  of  structures  at  the  moment, 
so  this  rule  failed.  After  about  20  cpu  seconds,  the  time  and  space  quanta  were  both  Just 
about  exhausted.  Roughly  30  examples  of  sets  were  found. 

In  similar  ways,  examples  were  found  for  Bags,  Lists,  Osets,  and  Ordered-pairs.  Examples 
of  structural  operations  were  found  "incidentally"  as  above  -  to  aid  in  producing  examples 
of  a  certain  kind  of  structure.  Occasionally,  the  primary  task  (the  one  selected  from  the 
agenda)  was  to  find  examples  of  a  given  operation.  In  that  case,  AM  spent  a  great  deal  of 
time  Just  on  that  one  operation.  For  example,  consider  Set-union.  When  AM  got  around  to 
filling  in  some  examples  for  it,  many  examples  already  existed  under  the  concepts  of  Sets. 
Bags,  and  Bag-union.  One  way  to  get  examples  of  Set-union  was  by  analogy  to  Bag-union. 
This  caused  some  slightly  erroneous  entries  to  be  found  (e.g.,  {a,b,c}u{a,c,e}'-{a,a,b.c,c,e}). 
Such  errors  were  soon  caught  and  corrected  when  the  task  "Check  examples  of  Set-union"  was 
chosen  from  the  agenda.  Similar  errors  and  corrections  occurred  when  AM  viewed  lists  as 
if  they  were  osets,  in  order  to  find  examples  of  osets. 

The  simple  development  described  above  was  broken  frequently  by  .some  new  concept  being 
created  and  found  to  be  very  interesting.  In  fact,  if  left  to  its  own  Judgment,  AM  would 
never  finish  the  above  process,  because  of  these  interruptions.  The  user  must  interrupt  and 
tell  it  to  ignore  new  concepts,  if  he  really  wants  AM  to  finish  finding  examples  of  all  those 
structures  and  operations. 

What  kinds  of  concepts  were  created,  and  why?  What  did  AM  do  to  investigate  them?  In 
general,  what  happened  was  this:  As  AM  collected  examples  of  a  concept  C,  the 
characteristics  of  its  efforts  (how  easy/hard  to  find  examples,  how  varied  they  were,  etc.) 
caused  certain  heuristic  rules  to  trigger.  Those  rules  then  caused  some  new  concepts  to  be 
created,  for  a  particular  reason.  AM  would  find  examples  of  them,  then  compare  the  results 
with  already-known  concepts. 

The  first  instance  of  this  process  was  when  AM  found  many  examples  of  sets  so  easily.  A 
rule  said  that  in  such  cases,  it  was  worthwhile  specializing  the  concept  Sets.  That  is,  make  a 
new  concept  which  would  have  fewer  examples.  One  way  AM  did  this  was  to  look  over  the 
Interest  features  of  all  generalizations  of  Sets,  pluck  a  couple  of  them,  and  conjoin  them 
onto  the  definition  of  Seu,  thereby  getting  a  definition  for  a  brand  new  concept,  called 
interesting-sets  or  INT-Sets  for  short.  The  feature  selected  first  was  the  following:  each  pair 
of  elements  of  the  structure  satisfy  the  same  rare  predicr.te  P,  for  some  P.  This  was 
previously  tacked  onto  the  Interest  facet  of  Structures.  Initially,  there  were  very  few 
predicates  knew't;  Constantly-True,  Constantly-False,  Object-Equality.  The  following  three 
types  of  INT-Sets  ,vnre  therefore  eventually  found: 

(i)  Sets  -  the  tame  concept  but  in  a  new  disguise  (for  any  pair  of  members  from  any 
set,  Conitantly-True  woild  return  True), 

(ii)  Empty-sets  -  an  already  known  concept,  but  now  with  a  new  definition  (for  any 
pair  of  members  from  any  set,  Constantly-False  would  never  return  True),  and 

(iii)  Singletons  '{a}*  (sets  for  which  ail  pairs  are  Equal  to  each  other). 


. 


\ 
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This  immediately  catapulted  the  empty  set  to  stardom.  Another  heuristic  rule  had  AM 

restrict  its  attention  to  the  predicates  which  were  not  trivial.  In  thi:  case,  both  Constant-  , 

True  and  Constant-False  were  no  longer  eligible.  So  the  only  remaining  INT-Sets  were 

those  for  which  all  pairs  of  elements  were  Equal.  AM  decided  to  explicitly  define  a  new  ] 

kind  of  set  having  just  that  definition.  This  became  a  specialization  of  INT-Sets,  called 

Equality-sets  or  Esets  for  short.  Since  the  empty  set  was  itlready  distinguished,  it  was 

decided  not  to  include  it  as  a  vaiia  Eset.  So  Esets  were  precisely  the  sets  we  would  call  : 

Singletons.  i 

Unfortunately,  the  set-operation  Union,  when  applied  to  Singletbns,  didn’t  always  yield 

singletons.  By  isolating  the  kind  of  sets  they  did  yield,  and  not  counting  the  few  extreme 

cases  when  they  yielded  singletons,  AM  discovert  the  concept  of  Doubleton:  a  set  with  i 

precisely  two  members.  Typically,  the  union  of  Singletons  was  a  doubleton,  their  ; 

intersection  was  the  empty  set,  their  Set-difference  was  a  singleton,  inserting  anything  into  a  1 

singleton  was  a  doubleton,  removing  something  left  a  singleton,  etc.  The  exceptions  were  all 

related  to  when  the  arguments  were  equal. 

Several  more  structural  concepts  were  created  and  explored  in  this  way,  besides  Singleton: 

Doubleton,  Tripleton,  Function  (an  operation,  all  of  whose  values  were  singleton  sets:  i.e.,  a  j 

single-valued  operation),...  In  general,  these  occurred  because  the  intial  primitive  concepts  1 

were  too  general,  too  easy  to  satisfy.  i 

During  its  investigation  of  Set-Intersection,  AM  noticed  that  sometimes  XnY-X,  and 
formalized  that  relationship  between  two  sets.  This  is  just  the  relation  we  normally  call  i 

Superset.  The  notion  of  Subset  also  was  discovered,  but  AM  never  had  much  interest  in  ‘ 

either  of  these  notions.  ^ 

When  AM  looked  for  examples  satisfying  the  predicate  Object-Equality,  abbreviated  Equal,  [ 

the  situation  was  just  the  opposite.  A  heuristic  rule  attached  to  'Active'  indicated  that  \ 

examples  could  be  found  by  randomly  instantiating  the  tv;o  arguments  of  Equal  with  a  pair  •' 

of  objects.  There  were  about  100  known  examples  of  stsuctures.  AM  gathered  them  into  one  i 

set,  and  then  repeatedly  chose  a  pair  of  them  to  run  Equal  on.  Thus  only  about  1%  of  the  I 

time  did  it  succeed  (did  Equal  return  the  value  T).  Another  heuristic  triggerred,  and  said  ;; 

that  in  such  cases,  it  was  worthwhile  trying  to  ge'ieralize  the  predicate  Equal.  A  new  task  to  k 

this  effect  was  added  to  the  agenda.  [ 

{ 

> 

Soon,  AM  selected  this  task,  and  tried  to  create  new  concepts  which  were  generalizations  of  [ 

Equal.  One  definition  of  Equal  was  a  transparent  recursive  one,  which  essentially  said  that  ^ 

X  and  y  were  Equal  iff  their  Cars  and  their  Crirs  were,  plus  it  had  a  base  step  that  asked  if 
both  arguments  were  empty  (in  which  case  Equal  returned  True),  or  if  precisely  one  i 

argument  was  empty  (in  which  case  Equal  returned  False),  or  if  both  arguments  were  ;; 

Identical  atoms  (True),  or  if  they  were  nonide/itical  atoms  or  only  one  was  an  atom  (False).  | 

^  (x,y) 

IF  X  and  y  are  identical  atoms,  THEN  return  True; 

ELSE  IF  either  x  or  y  is  not  a  list,  THEN  return  False; 

ELSE  IF  both  x  and  y  are  Null  lists,  THEN  return  True; 

ELSE  IF  only  one  of  x  or  y  is  Null,  THEN  return  False; 

ELSE  both  of  these  must  be  true: 

EquaK  CAR(x),  CAR(y) ) 

EquaK  C0R{x),  CDR(y) ) 


■'•Base 
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One  heuristic  rule  that  AM  possessed  suggested  that  this  could  be  generalized  in  a  small 
way  by  weakening  the  base  step.  A  couple  new  concepts' were  created  this  way,  but  they  all 
turned  out  to  be  trivial:  either  constantly  returning  T.  or  no  different  than  Equal  was. 

Another  heuristic  suggested  weakening  the  recursive  step.  One  way  to  do  this,  since  the 
recursive  step  is  a  conjunction,  is  to  remove  one  of  the  conjuncts.  The  rule  checks  to  ensure 
that  the  definition  is  still  recursive:  one  of  the  remaining  conjuncts  must  call  on  the  function 
itself.  In  this  case,  both  conjuncts  call  on  Equal,  so  AM  can  remove  either  one.  Two  new 
concepts  were  generated  in  this  manner.  For  example,  here  is  one  of  them,  which  AM 
named  “Equ0": 

X  (x,y) 

IF  X  and  y  are  identical  atoms,  THEN  return  True; 

ELSE  IF  either  x  or  y  is  not  a  list,  THEN  return  False; 

ELSE  IF  both  x  and  y  are  Null  lisiisi  THEN  return  True; 

ELSE  IF  only  one  of  x  or  y  is  Null,  THEN  return  False; 

ELSE  EquO(  CDR(x),  CDR(y) ) 

Note  that  the  base  cases  were  unchanged;  the  recursive  step  is  a  recursion  in  the  CDR 
direction,  but  no  longer  in  the  CAR  direction.  A  note  to  that  effect  is  placed  inside  the 
definition  of  EquO  itself,  as  a  comment.  Other  parts  of  EquO  can  be  filled  in  easily;  it  is  a 
generalization  of  Equal,  it  is  an  example  of  a  Predicate,  its  domain/range  is  the  same  as 
Equal,  its  worth  is  initially  set  a  little  higher  than  Equal's,  etc. 

This  predicate  maps  down  two  lists,  one  element  at  a  time,  and  returns  True  iff  they  both 
become  empty  at  the  same  moment  That  is.  iff  they  have  the  same  length.  In  fact,  we  can 
assume  that  the  user  Interrupts  and  orders  AM  to  rename  EquO  as  "Same-length". 

The  other  new  generalization,  called  "Equl",  examines  the  CAR’s  (l.e.,  the  first  elements)  of 
a  pair  of  lists,  and  returns  True  if  they  were  identical  atoms;  if  they  were  both  lists,  it 
recurses  on  those  two  lists.  A  human  LISP  programmer’s  interpretation  is  as  follows;  when 
the  two  lists  were  written  out  in  standard  notation  (u.ung  parentheses),  ail  the  initial  left 
parentheses  match,  and  the  first  non-left-parenthesis  entity  of  each  list  also  matches. 
Although  this  is  isomorphic  to  numbers  again^  AM  didn’t  pursue  this  concept  very  far. 
Most  of  the  examples  of  lists  AM  knew  abwt  were  only  l-level  deep,  although  they  varied 
significantly  in  length.  If  it  had  been  otherwise,  AM  might  have  developed  Equl  into 
Same-length,  and  lost  interest  in  EquO.  As  it  was,  the  Worth  of  this  concept  Equl  slowly 
declined,  and  very  few  tasks  involving  it  bubbled  up  to  the  top  of  the  agenda. 

The  concept  of  Same-length  will  form  the  springboard  for  the  development  of  caidinality,  a 
tale  which  is  related  in  a  little  while.  Before  moving  on,  let’s  mention  a  couple  more  set- 
theoretic  activities  that  AM  did. 

Several  moderately  interesting  compositions  and  coalescings  were  done  to  set-operations. 
(Actually,  to  structure-operations).  First  let’s  discuss  the  coalescings  of  operations  f(x,y.)  into 
new  operations  f(x,x)  -  where  a  pair  of  arguments  is  made  to  coincide.  Coalescing  Insert 
(insert  S  into  itself)  led  to  an  operation  which  always  seemed  to  give  a  bigger  set  than  it 
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Started  with.  This  led  AM  to  the  finitely-true  conjecture  that  a  set  is  never  a  member  of 
itself.  The  conjecture  was  rediscovered  when  the  coalescing  of  Delete  seemed  to  produce  the 
identity  operation  (Deleting  S  from  S  never  seemed  to  change  the  value  of  S). 

Coalescing  Intersect  also  gave  the  identity  operation;  this  showed  that  SnS»S  (apparently). 
Similarly  for  Union.  Coalescing  "Compose"  gave  a  new  operation  of  some  value:  the  notion 
of  composing  f  with  f.  When  this  was  applied  to,  e.g.,  Intersect,  it  created  the  new  operation 
Intersectointersect,  which  took  3  arguments  and  formed  their  common  intersection.  By 
forming  this  in  two  ways  -  (xny)ni  and  also  xn(yni)  -  AM  noticed  that  they  were  the  same, 
that  the  order  didn’t  matter.  Since  xnx  had  already  been  shown  to  be  the  identity  operation, 
multiple  occurrences  of  an  element  in  an  intersection  don’t  make  any  difference.  Finally,  AM 
had  constructed  xny  and  ynx  as  two  separate  operations,  and  then  found  them  to  be 
equivalent.  Taking  all  these  results  together,  it  was  possible  to  view  n  as  taking  a  i€t  of  sets 
as  its  argument,  and  forming  the  intersection  of  ail  those  sets.  Thus  n({  {a,b,c},{c,g,h},{a,c,e} 

D-W- 

Some  valuable  compositons  were  formed.  By  forming  xn(yuz)  and  (xny)u(xnz)  as  two 
separate  compositions,  AM  found  their  equivalence  via  experimental  data.  This  was  one 
case  where  the  Intuition  functions  had  given  AM  an  unfair  advantage,  since  the  Venn- 
diagram  abstract  representation  for  sets  suggested  this  relationship  explicitly.  When  the 
intuition  was  removed,  the  discovery  was  made  much  more  valid.  All  of  de  Morgan’s  laws 
were  discovered  in  this  manner,  incidentally.  Several  special  cases  were  singled  out, 
involving  empty  sets,  singletons,  and  doubletons.® 

The  compositon  Deleteolnsert  is  not  quite  so  trivial  as  one  might  think;  it  takes  a  slructre  S, 
inserts  an  element  e,  and  then  removes  element  e.  This  simple  operation  can  be  used  to  test 
the  type  of  structure  that  S  is;  it  never  alters  a  Bag  or  a  List,  but  it  does  alter  Sets  and 
Osets.  Orthogonally,  InsertoDelete  always  alters  Lists  and  Osets,  but  can  leave  Bags  and 
Sets  unchanged.  The  first  composition  tests  for  multiple  elements,  and  the  second 
composition  tests  for  order.  .AM  defined  both  of  these,  and  (at  least  pai*'  tlly)  perceived  their 
abilities  to  distinguish  structural  types. 

Operations  formed  by  switching  the  two  arguments  of  an  old  eperauon  wer;  t-^a  'ginally 
Interesting.  They  helped  pin  down  the  true  nature  of  what  kind  oi  argument  the  operation 
should  be  considered  as  taking.  For  example,  Union(x,y)«Union(y,x),  so  Union  v  in  t:\ke  an 
unordered  collection  of  sets  as  its  argument,  and  form  their  union.  Similarly,  '  'e  $e«  that 
Insert(x,y)  is  in  general  quite  different  from  Ins£rt(y,x).  When  AM  tries  to  see  v;hat 
.ivariants  exist  for  this  operation,  it  can  notice  only  ih;.  the  trivial  constraint  x-y  is 
iijfficient  to  cause  the  order  of  arguments  not  to  matter.  If  it  had  the  concept  of  th>i'  LISP 
function  "COUNT",  which  counts  up  all  the  storage  space  used,  or  "FLATTEN",  which 
removes  all  parentheses  from  a  list  structure,  then  AM  would  notice  that  the  COUNTV  of 
boi>  forms  of  Inserting  were  \  ual  in  number,  and  that  their  FLAT!  EN’s  were  equal  sets 
of  elr-  nents. 

Looxinj;  for  invariants  is  one  favorite  pastime  of  AM.  In  general,  two  operations  f  and  g 
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will  not  coincide.  AM  wants  to  find  a  unifying  operation  U,  such  that  U(f(x))-U(g(x))i  or, 
AM  tries  to  find  a  U  such  that  f(U(x))-g(U(x)).  This  is  of  course  the  idea  mathematicians 
normally  refer  to  as  homomorphism.  AM  calls  this  process  Canonizing.  Canonizing  the  two 
orders  of  Insert  (x  into  y,  and  y  i«to  x)  would  hopefully  find  the  operation  U-FLATTEN 
or  U-COUNT.  Canonizing  Same-length  will  cause  the  operation  of  Length  to  be 
synthesized.  Canonizing  the  notion  of  angles-equal-in-measure  were  the  operations  we 
normally  call  rigid  motions  in  the  plane. 

Let’s  pick  the  main  thread  of  development  again,  before  we  lose  it  entirely.  Earlier,  the 
operation  "Same-length"  was  synthesized,  as  a  generalized  form  of  the  predicate  which  told 
when  two  structures  were  equal.  Same-length(x,y}  is  True  iff  x  and  y  are  two  list  structures 
which  have  the  same  length  (i.e.,  the  same  number  of  elements).  This  new  predicate  was 
examined  by  AM,  and  sure  enough  it  let  many  more  pairs  of  random  objects  return  True 
than  Equality  did,  yet  it  didn’t  let  too  high  a  percentage  through:  just  about  lOX.  This  made 
AM  want  to  find  an  invariant,  a  canonizing  function  f,  which  put  any  given  list  structure  x 
into  a  standard  form  f(x),  satisfying: 

Sam«>l«ngih(x,y)  iff  Equal(f(s),f(y)) 

That  is,  X  and  y  would  have  the  same  length  precisely  when  the  standard  forms  of  x  and  y 
were  identically  equal  to  each  other. 

AM  had  to  synthesize  this  function  f,  step  by  step.  First  it  performed  some  experiments,  and 
found  that  Same-length  didn’t  care  what  the  type  of  its  arguments  were.  If  a  certain  Set  S 
did/didn’t  satisfy  Same-length(R,S),  then  the  same  result  would  obtain  If  S  were  replaced  by 
Viewing  S  as  a  list,  or  as  a  bag,  or  as  an  oset.  Thus  the  standard  form  of  a  structure  could 
be  fixed  as  one  specific  type.  But  which  should  it  be  (bag,  set,  list,  oset)?  To  find  out,  AM 
ran  two  more  experiments.  The  first  experiment  was  to  see  whether  Same-length(R,S)  was 
affected  when  the  order  of  elements  inside  R  were  changed.  This  turned  out  to  be  negative. 
So  R  might  as  well  be  an  unordered  structure:  bag  or  set.  The  next  experiment  had  AM 
decide  whether  or  not  multple  elements  Inside  R  would  affect  the  value  of  Same-length(R,S). 
This  turned  out  positive,  so  multiple  elements  had  to  be  taken  into  account.  The  canonical 
type  of  argument  was  thus  either  bag  or  list.  Together,  the  two  experiments  indicated  that 
the  type  had  to  be  Bag.  So  the  canonizing  function  f  would  first  convert  any  argument  R  to 
a  bag.  A  note  tacked  onto  the  Same-length  concept’s  definition  said  that  this  concept  didn’t 
look  at  the  Car’s  or  value-cells  of  its  arguments.  That  would  mean  that  they  should  all  be 
replaced  by  some  fixed  value,  like  T.  This  checked  out  experimentally.  So  f  should  replace 
each  element  in  the  structure  R  by  the  constant  T.  The  final  operation  f  synthesized  was: 

f(R)  ■  ReplKfeKh-element'byoT  ( Convart'lo'-bag  (R) ). 

This  operation  would  convert  {a,  (b,c,{d},e,e),  q}  into  (T,T,T).  The  set  of  standard  forms  for 
bags  was  called  Canonical-bags,  and  renam^  by  the  user  as  Numbers.  The  canonizing 
operation  f  was  called  Length,  by  the  user,  since  it  converts  any  structure  into  a  "number" 
which  represents  the  length  of  that  structure: 

Sarn«o|«nglh(R,$}  iff  Equ«l(Ltiigth{R},Lenglh(S}} 

AM  now  made  explicit  an  important  analogy:  bags^numbers,  equal4«same-iength,  bag- 
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operations*«[tho5e  same  operations,  restricted  to  canonical-bags].  Several  minutes  were  spent 
developing  these  restricted  bag-operations.  The  old  operation  of  inserting  a  bag  S  into 
itself  provided  a  cute  way  to  "add  I"  to  any  number.  The  Bag-union  operation  union 
turned  into  what  we  call  Addition: 


Bif-onion(  (T  T  T)  (T  T) )  -  (T  T  T  T  T)  means  "3^2-5",  in  unary. 

TIMES  was  discovered  in  four  ways,  as  discussed  previously  in  the  thesis. 

The  intersection  of  two  "numbers"  is  the  operation  we  call  MINIMUM: 
lnt«rsect{(T  T  T)  (T  T  T  T))  •  (T  T  T)  just  says  *Minimum(3,4)-3". 

AM  likes  to  find  inverses,  and  the  inverse  of  these  operations  turned  out  to  be  the  ones  we 
call  subtraction,  factoring,  division,  less-than,  etc. 

AM  likes  coalescing,  and  some  important  operations  were  created  that  way,  too:  Add<x,x)  is 
Doubling;  Times(x,x)  is  Squaring:  the  inverses  of  those  were  Halving  and  Square-rooting 
(both  restricted  to  natural  numbers). 

AM  worried  about  which  numbers  could  be  halved  or  square-rooted,  and  this  led  to  the 
creation  of  the  concepts  Even-numbers  and  Perfect-squares.  When  asking  when  a  number  z 
can  be  the  result  of  a  multiplication  involving  x,  AM  derived  the  notion  of  Divides; 
analogously,  AM  defined  the  relation  which  meant  that  Add  of  x  and  something  else  could 
yield  z.  That  relation  is  Just  "S",  and  was  called  LECtby  the  user.  AM  noticed  many  simple 
properties  of  inequalities. 

AM  likes  composing  and  reversing  args,  and  thereby  figured  out  that  most  arithmetic 
operations  like  Add  and  Times  take  a  bag  of  numbers  to  work  on. 

TIMES*’  was,  as  we  said,  factoring:  given  n,  find  ail  possible  bags  of  numbers  (>1)  whose 
product  yielded  n.  The  identity  of  Times  ("1")  was  intentionally  excluded,  since  its  presence 
in  any  quantity  would  not  affect  the  result  of  Times.  AM  soon  asked  itself  about  numbers 
with  extreme  or  unusual  factorings. 

Primes  were  found  In  this  way,  as  was  Goldbach’s  conjecture.  The  example  in  chapter  2 
went  into  this  in  great  detail.  A  large  number  of  spurious  analogous  concepts  were  created 
and  studied,  to  no  avail.  For  example,  AM  asked  itself  about  numbers  which  could  uniquely 
represented  as  the  sum  of  two  primes.  As  another  example,  AM  defined  the  concept  of 
Square-roots-of-prlmes,  which  of  course  was  not  a  winner. 

The  unique  factorization  of  any  number  into  primes  was  perceived  quite  naturally,  but 
many  seemingly  elementary  concepts  were  left  undiscovered.  AM  never  defined  gcd  (the 
greatest  common  divisors)  onits  own;  it  was,  however,  possible  to  guide  it  to  discovering  that 
concept. 

The  task-by-task  trace  in  the  next  section  closes  with  the  restriction  of  addition  to  primes; 
I.e.,  p+q-r  for  primes  p,q,r.  This  situation  only  occurs  when  p  (say)  is  2,  and  q,r  form  a 
prime  pair.  That  trace  will  of  course  delve  into  concepts  not  mentioned  above,  and 
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conversely  AM  happended  to  miss,  onthat  run,  some  of  those  mentioned  in  this  section. 
Finally,  both  sections  omit  mention  of  several  interesting  acitivites  AM  performed: 
maximally-divisibles  and  all  the  geometric  concepts,  for  example.  The  line  must  be  drawn 
somewhere.  The  frustrated  reader  should  try  AM  himself,  and  watch  its  progress. 

Appendix  5.2.  A  *Nice’  Task-by-task  Trace 

Now  that  we’ve  discussed  this  development  at  a  fairly  high  level,  let’s  list  what  AM  did  task 
by  task.  The  lines  below  give  a  highly  compressed  "trace"  of  AM’s  sequence  of  behavior. 
The  tasks  in  the  "best  run"^  are  listed  in  order,  and  numbered.  For  each,  I  have  condensed 
AM’s  printout,  usually  retaining  some  of  the  reasons  AM  had  for  atempting  the  task,  the 
methods  AM  used,  the  final  outcome,  and  occasionally  a  bit  of  commentary  (in  italics).  The 
task  numbers  below  correspond  to  the  numbering  used  in  Section  6.1,  starting  on  page  1 15. 

««  TatK  1  «$  Fill  in  axamplet  of  Compote,  because  Compose  is  highly-rated  and  no  examples 
of  Compose  are  known  yet.  Several  heuristics  relevant,  but  none  succeeded.  One  of  them, 
heuristic  rule  number  122,  failed  because  no  operations  could  be  found  which  had 
examples.  Also  important  at  this  moment  was  heuristic  rule  number  156;  see  Appendix  3. 

«*  Task  2  «*  Fill  in  ox^mplot  of  Sat'union,  because  Set-union  is  highly-rated,  and  no 
examples  of  Set-union  are  known  yet,  and  if  some  examples  had  been  known  earlier  then 
AM  would  have  been  able  to  Fill  in  examples  of  Compose.  Several  heuristics  relevant,  but 
again  none  succeeded. 

**  Tatk  3  i»  Fill  in  oxamplot  of  Soli,  because  Sets  is  highly-rated,  and  no  examples  of  Sets 
are  known  yet,  and  if  some  examples  had  been  Known  earlier  then  AM  would  have  been 
able  to  Fill  in  example  of  Set-union,  and  AM  was  recently  working  on  Domain(Set-union). 
and  AM  was  recently  working  on  Range(Set-union).  Several  heuristic  rules  are  relevant. 
After  rule  31  suaeeded,  so  could  many  other  techniques  (e.g.,  rule  38).  In  fact,  it  was  so 
easy  for  AM  to  crank  out  one  example  of  a  set  after  another,  that  rule  45  triggered. 

«*  Task  4  **  Fill  in  specializations  of  Sets,  because  it  was  very  easy  to  find  examples  of  Sets, 
and  no  specializations  of  Sets  exist  yet,  and  Focus  of  Attention.  Many  ways  of  creating  new 
concepts,  new  specialized  forms  of  Sets,  were  relevant.  AM  created  INT-Sets,  defined  as  "X 
(S)  S  is  a  set,  and  all  pairs  of  members  of  S  satisfy  the  rare  predicate  P".  AM  also  created 
Bl-Sets,  defined  as  *  X  (S)  S-{},  or  else  both  CAR(S)  and  CDR(S)  are  Bl-Sets."  The  former 
specialization  will  lead  to  Singletons,  the  latter  deals  with  nests  of  braces  (sets  with  no  atomic 
elements)} 

**  Tack  5  »«  Fill  in  oxamplos  of  INT«$ait,  because  any  such  item  will  automatically  be  an 
interesting  exantple  of  a  Set,  and  INT-Sets  was  just  created,  and  no  examples  of  it  are 
known  yet.  Very  slowly,  6  examples  were  found  via  inference,  and  then  7  more  were 
produced  quickly  via  more  brutish  methods.  After  eliminating  duplicates,  only  3  remain:  {}, 
{A},  and  {B}.  In  each  case,  the  predicate  P  was  "Equal",  and  the  worth  of  the  concept 
Equal  was  raised  slightly. 

^  Actually,  $  coupit  *vtry  geod*  run*  !i*v()  bean  (pptndtd,  but  NOT  tplietd  toiethcr  to  tho  bonofH  of  AM. 

^  R*«*lt  that  italics  aif  nHy  tha  author'*  comatanta,  and  contain  information  which  AM  —  and  probably  th*  uaar  a*  wall  — 
could  not  hav*  known  at  th*  tim*. 
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Twk  6  **  Chffck  all  axamplas  of  INT*Sots,  because  many  unchecked  examples  are  present 
there,  and  Focus  of  Attention.  All  three  examples  were  confirmed.  No  surprises  were 
encountered.  One  of  the  three  INT-Sets  turned  out  to  be  an  Empty-structure,  but  no 
conjecture  was  actually  formulated  at  this  time. 

**  Task  7  **  Chock  all  oxamplos  of  Sots,  because  many  were  recently  found  and  never 
checked,  and  the  previous  task  dealt  with  a  specialization  of  the  Sets  concept.  One  small 
modification  had  to  be  made  to  one  of  the  sets  (namely,  {b,a,b}-«{a,b}).  Based  on  empirical 
evidence,  AM  hypothesizes  that  Sets  may  really  be  no  more  specialized  than  Unordered- 
strucs.  AM  will  reserve  Judgment  until  after  it  has  tried  to  find  examples  of  Bags.  (See 
heuristic  rule  108,  page  248.)  Similarly,  AM  considers  the  possibility  that  all  Non-multiple- 
elements-strucs  are  really  Sets  as  well.  Before  relying  on  this  flimsy  conjecture,  AM  must  first 
took  for  examples  of  Osets,  and  see  if  tht>j  are  really  also  Sets. 

*«  Titk  8  ««  Fill  in  •xamples  of  Bi{s,  because  no  examples  of  Bags  exist  yet,  and  to  help 
settle  the  question  about  all  unordered  structures  being  sets.  10  examples  found.  None  are 
sets. 

»*  Talk  9  »«  Fill  in  speciilizations  of  Ba{s,  because  it  was  very  easy  to  find  examples  of  Bags, 
and  no  specializations  of  Bags  are  known  about  yet,  and  Focus  of  Attention.  Many  ways  of 
creating  new  concepts,  new  specialized  forms  of  Bags,  were  relevant.  AM  created  INT-Bags, 
defined  as  "X  (S)  S  is  a  Bag,  and  ait  pairs  of  members  of  S  satisfy  the  rare  predicate  P". 
AM  also  created  BI-Bags,  defined  as "  X  (S)  S»(),  or  else  both  CAR(S)  and  CDR(S)  are  BI- 
Bags." 

**  Twk  10  ««  Fill  in  oxampitt  of  Oioti,  because  none  exist  yet,  and  to  help  settle  the  question 
about  all  nonmult-strua  being  sets.  13  distinct  examples  found.  None  are  sets. 

«*  Tuk  11  »*  Chock  txampitt  of  Oaola,  because  many  unchecked  examples  of  Osets  exist  on 
Osets.Exs,  and  Focus  of  Attention.  All  confirmed.  The  prioirty  rating  of  this  task  is  not 
high  enough  to  inspire  the  creation  of  any  new  concepts.  One  weak  conjecture  made:  all 
ordered  structures  are  Osets.  Will  settle  this  by: 

**  Tatk  12  **  Fill  in  oxamploi  of  Liata,  because  none  exist  yet,  and  to  help  settle  the  question 
about  all  ord-strucs  being  osets.  29  examples  found.  None  are  osets. 

««  Tatk  13  Chock  oxamploi  of  Liiit,  because  many  unchecked  examples  of  Lists  exist,  and 
Focus  of  Attention.  All  confirmed.  Nothing  special  noted  or  motivated. 

♦♦  Tatk  14  ♦*  Fill  in  oxamploi  of  All-bui-firii,  because  no  such  examples  exist  yet,  and  AM 
was  Just  working  on  Domain(All-but-first),  and  AM  was  recently  working  on  Domain(All- 
but-first).  The  similarity  of  the  last  two  reasons  escaped  AM,  and  points  up  one  of  its  flaws. 
AM  is  swayed  by  the  presence  of  a  slightly-different  reason  just  as  much  as  by  a  very-different 
supporting  reason.  There  Is  no  processing  done  on  the  reasons.  Choosing  this  task  represents 
a  radical  shift  of  attention  for  AM.  32  examples  found,  mostly  by  applying  All-but-first  to 
the  examples  of  Lists  and  Osets  already  known. 
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Task  15  «*  Fill  in  •xamplai  of  Alhbui-last,  because  none  exist  yet,  and  AM  was  recently 
working  on  Domain(AII-but-last).  Another  poorly-motivated  task.  Luckily  for  AM,  the  user 
erroneously  believes  that  this  task  is  also  chosen  to  be  symmetric  with  the  last  task.  45 
examples  found. 

»*  Ta«k  16  Fill  In  apecialixationt  of  AII-but*lMl,  because  it’s  so  easy  to  find  examples  of  it, 
and  no  specializations  exist  yet,  and  Focus  of  Attention.  The  syntactic  methods  AM  can 
bring  to  bear  are  not  enough  to  produce  any  meaningful  new  concepts,  and  this  task  Fails. 
Failure  of  a  task  causes  ‘Focus  of  Attention'  to  go  away  f dr  one  cycle. 

*»  Task  17  **  Fill  in  txamplas  of  Liii-union,  because  none  exist  yet.  Another  wild  shift  of 
attention.  3  examples  derived  by  symbolic  manipulation  of  the  definition  facet  of  this 
concept,  then  22  more  derived  using  less  inferential  techniques  (some  were  garnered  by 
running  List-union. A Ig  itself  on  the  early  examples!). 

**  Task  18  ««  Fill  in  •xamplat  of  Projl,  because  none  exist  yet.  26  found. 

««  Task  19  Chock  oxamplos  of  All-bui-firsi,  because  many  were  recently  found  but  not  yet 
confirmed.  All  check  out.  This  task  has  no  repercussions  (new  concepts,  tasks,  etc.). 

*»  Task  20  -t*  Chock  oxamplos  of  All-but-last,  because  many  unchecked  examples  exist  on  the 
Examples  facet  of  All-but-last.  All  confirmed,  with  no  repercussions.  If  the  initial  Worth 
values  of  All-but~frst  and  AlUut-last  are  set  high  enough,  AM  defines  a  new  concept  at  this 
point,  a  new  kind  of  ordered  structure:  \  (S)  All-buUfirst(S)  -  AlLbut-last(S).  In  fact,  the  only 
kind  of  Osets  Included  herein  are  those  which  are  singletons  or  empty.  Among  lists,  it  also 
includes  those  which  contain  just  one  kind  of  element. 

t*  Task  21  »«  Fill  in  oxamplos  of  Prej2,  because  none  exist  yet.  26  found.  AM  will  typically 
quit  looking  for  examples  if  (i)  the  time  allocated  runs  out,  or  (ii)  the  space  allocated  is  used 
up,  or  (iii)  26  examples  are  found,  or  (iv)  151  attempts  to  find  examples  fail.  The  cosmic 
'significance  of  151  has  rarely  been  recognized  in  print.  Perhaps  15 1  is  more  comic  than  cosmic. 
Seriously,  these  numbers  must  be  changed  by  almost  an  order  of  magnitude  before  any  gross 
change  in  AM’s  behavior  is  noticed. 

**  Task  22  »*  Fill  in  oxamplos  of  Empiy-siructuros,  because  none  exist  yet,  and  the  Worth  of 
this  concept  was  increased  recently  (during  task  6).  Just  by  looking  at  the  examples  of 
Structures  (i.e.,  using  heuristic  rule  number  29),  AM  is  able  to  get  four  empty  ones:  {},  [].  <>, 
0;  i.e.,  the  empty  set,  oset,  list,  and  bag.  Although  some  of  these  are  rederived  in  other  ways, 
there  are  no  other  examples  ever  found.  This  paucity  triggers  a  rule  which  suggests  this 
task: 
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*«  TitK  23  **  Fill  In  {•ntrallzatlons  of  Empi/'-atructuras,  because  it  was  very  hard  -  but  not 
impossible  -  to  find  examples  of  that  concept,  and  Focus  of  Attention.  AM  examines  the 
definitions  of  Empty-structure,  but  can’t  find  any  recognizable  syntactic  pattern  it  knows 
about.  It  docs  find  (NOT  (SOME-MEMBER  S)),  and  tries  to  replace  SOME-MEMBER  by  a 
specialization  of  same,  but  none  is  known  to  exist.  If  the  user  initially  tells  AM  that  First- 
member  and  Last-member  are  specializations  of  Some-member,  then  AM  can  generalize  Empty- 
structures.  In  fact,  it  then  comes  up  with  what  is  equivalent  to  'Empty-struc  u  Unord-struc’. 
In  the  current  setup,  however,  this  task  fails.  If  AM  had  a  better  understanding  of 
constructive! destructive  operations,  it  might  have  defined  Structures-with-empty-CARs,  or 
perhaps  Structvres-with-empty-CDRs  (i.e.,  Singletons  again). 

**  Task  24  ♦»  Check  examples  of  Lisi-union,  because  many  were  recently  added  but  not  yet 
confirmed.  This  shows  the  mechanical  patience  that  a  'stack’  gives  you.  Since  no  higher- 
priority  task  has  been  suggested,  AM  attends  to  a  task  which  has  been  on  there  for  a  while. 

**  Task  25  *«  Chack  axamplas  of  Ba{8,  because  many  examples  and  a  couple  specializations 
exist.  A  few  small  modifications  had  to  be  made  (e.g.,  (A  C  B  A)  -» (A  A  B  C)). 

«*  Task  26  **  Fill  in  oxamplts  of  Bag’union,  because  none  exist  yet,  and  AM  was  just 
working  on  Domain(Bag-union),  and  AM  was  just  working  on  Range(Bag.union).  Note  the 
influence  of  heuristic  rule  numer  H.  This  task  proceeded  smoothly,  with  26  examples  being 
generated. 

*»  Talk  27  **  Check  axamplot  of  Proj2,  because  several  were  recently  found  and  not  yet 
chKked.  All  confirmed,  with  no  new  suggestions  generated. 

**  Task  28  ««  Fill  in  oxamplas  of  Sot-union,  because  none  exist  yet.  Again  we  see  rule  H  in 
action.  26  examples  found. 

*«  Task  29  **  Chock  oxamplos  of  Sot-union,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention  (AM  just  worked  on  Set-union).  A  few 
patches  had  to  be  made.  Often,  Set-union(x,y)  was  equal  to  one  of  its  arguments.  AM 
therefore  defined  a  new  predicate  Eq-union(x,y)  which  is  True  iff  Set-union(x,y)MX.  The 
user  later  renamed  this  "Superset-of,  since  this  is  the  relationship  of  containment.  In  typical 
math  texts,  containment  is  introduced  long  before  union,  and  then  this  is  a  theorem:  "A^B  iff 
AysB^A".  But  AM  used  "u"  to  form  the  definition  of  "a". 

»*  Task  30  ♦*  Fill  in  oxamplos  of  Bat-insort,  because  none  exist  yet,  and  AM  was  recently 
working  on  Domain(Bag-insert),  and  AM  was  recently  working  on  Range(Bag-insert).  A 
saddeningly  stupid  move  for  AM.  It  should  have  rated  Superset  higher,  and  continued 
working  on  fr.  AM  has  no  trouble  finding  many  examples  of  insertions  into  bags. 
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Ti«k  31  **  Check  examplei  of  Bafimort,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  AM  Just  worked  on  Bag-insert.  All  examples  were 
confirmed.  This  operation  always  seemed  to  result  in  Nonempty  bags.  The  Domain/range 
facet  was  so  amended.  The  value  is  never  either  of  its  arguments,  but  there  is  no  concrete 
action  AM  must  take  in  such  a  situation,  no  compact  way  of  noting  that  anti-relationship 
(short  of  creating  a  full-blown  conjecture).  Restricted  to  singletons.  Bag-insert  never 
produces  a  singleton  or  an  empty  bag.  AM  defines  the  concept  of  a  bag  gotten  by  doing  a 
Bag-insert  on  a  singleton;  i.e.,  the  notion  of  a  douhieton  bag. 

Tack  32  **  Fill  In  examplai  of  Bag-intorioet,  because  none  exist  yet,  and  AM  was  recently 
working  on  Domaln(Bag-lntersect),  and  AM  was  recently  working  on  Range(Bag-intersect). 
2p  found  without  trouble. 

*«  Teak  33  «*  Fill  in  oxamplea  of  $ot*iniort,  because  hone  exist  yet.  Just  another  data- 
gathering  task,  building  up  AM’s  store  of  empirical  results. 

**  Task  34  *«  Chock  oxamplea  of  $ot*inaori,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  AM  Just  worked  on  Set-insert.  Applying  this  operation 
always  seems  to  result  in  Nonempty  seu.  Domain/range  so  amended.  The  value  is 
sometimes  Just  one  of  its  aiguments.  AM  defines  what  will  eventually  be  called  Set- 
membership  in  this  way:  X  (x,S)  Set-insert(x,S)-S.  This  is  not  the  only  important  result  here. 
aM  notices  that  Set-insert(x,S)  is  always  related  to  S  by  Superset-of.  That  is,  Superset- 
of(Set-insert(x,S),  S)  [for  any  x).  So  conjectured.  This  doesn’t  actually  suggest  anything 
marvelous  for  AM  to  do  next,  however.  Incidentally,  during  this  task  AM  also  defines  the 
concept  of  doubleton  set. 

»»  Task  35  »*  Fill  in  examples  of  Bag-deleie,  because  none  exist  yet.  NoU  that  ^working  on 
bags'  is  so  long  past  that  it  is  no  longtr  given  as  a  reason  for  this  task.  Able  to  generate  two 
examples  by  manipulating  definitions  supplied  with  Bag-delete,  then  a  couple  dozen  more 
were  generated.  Some  were  generated  by  accessing  already-known  examples  of  the  domain 
(i.e.,  the  entries  on  the  Examples  facet  of  the  Bags  concept)  and  then  running  Bag-delete.Alg 
on  them. 

*«  Task  36  «*  Fill  in  examplat  of  Bag-differanca,  because  none  exist  yet.  26  found.  This  is  a 
good  vantage  point  to  look  back  and  ahead,  and  notice  that  while  the  surrounding  tasks  are  all 
reasonable  data-gathering  forays,  the  order  in  which  they’re  performed  is  unimportant.  Later 
on,  AM  will  follow  threads  of  discoveries,  and  the  order  of  tasks  then  wilt  appear  very 
important. 

**  Task  37  ««  Check  examples  of  Bsg*infersect,  because  many  examples  have  recently  been 
found,  but  not  yet  checked.  So  many  examples  were  found  that  AM  will  entertain  the  idea 
of  creating  a  specialized  new  concept.  Since  the  intersection  of  two  randomly-chosen  bags 
was  often  empty,  AM  defined  the  concept  of  disjoint  bags:  X  (x,y)  Bag-intersect(x,y)-(). 

Task  38  «»  Fill  In  examples  of  $et*intersect,  because  none  exist  yet.  Many  found  very 
easily. 


Appendix  5 


AM  Oiteovery  in  Mithemitic*  «•  Heurittic  Stxrch 


-259- 


Titk  39  *«  Chvck  txampits  of  Sat-intorsoct,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  3  small  modifications  had  to  be  made. 
This  time,  AM  noticed  that  the  intersection  of  two  sets  was  often  empty,  and  defined  the 
Disjoint-sets  concept.  AM  also  noted  that  xny  was  often  one  of  those  very  same  arguments, 
so  it  defined  the  relation  which  is  eventually  renamed  Subset:  >,  (x.y)  Sct-lntersect(x,y)«=x.  At 
the  moment,  AM  didn't  realize  that  there  was  any  connection  between  Subset  and  Superset. 
This  tic  came  much,  much  later  (Task  number  227  (qv.)  In  this  run). 

««  Task  40  «*  Fill  in  axampltt  of  Liit'intorsaet,  because  none  exist  yet,  and  the  interest  of 
Intersect  (the  general  concept  of  which  this  is  a  specialization)  has  recently  been  increased  a 
few  times.  26  found  without  incident. 

AM  is  bcrtd^i  Will  look  at  SutgosHypo  heuristics  for  new  things  to  do. 

If  ’‘Equality"  is  excised  at  this  moment,  AM  continues  the  preceding  line  of  inquiry 
for  a  while,  and  then  defines  Singleton-osets,  as  Osets  all  of  whose  members  are 
equal  to  each  other.  AM  notices  that  All-but-first  and  All-but-last,  restricted  to 
Singleton-osets,  always  yield  the  same  result,  namely  the  empty  oset.  AM  then 
"generalizes"  this  into  the  concept  which  is  all  the  osets  for  which  All-but- 
first(x)^All-but-last(x).  AM  then  turns  to  relationships  involving  Subset  and 
Superset,  followed  by  a  flurry  of  compositions  and  coalescings,  and  their 
investigation.  But  Equality  present,  so  AM  goes  off  on'  another  course  of 
exploration. 

«»  Task  41  **  Fill  In  examples  of  Equal,  because  Equal  has  recently  become  more  interesting, 
and  there  are  no  examples  known  yet.  Equal  became  more  interesting  gradually,  as  INT- 
Sets  were  define  and  liked,  Eq-union  defined  and  liked,  etc.  Once  chosen,  this  task  does  not 
go  smoothly.  By  inference  methods,  only  a  couple  examples  were  derived.  Later,  when 
heuristic  rule  number  122  was  run,  151  failures  were  encountered  and  only  2  successes.  This 
is  so  small  a  success  rate  that  a  heuristic  rule  strenuously  proposed  this  next  task,  with  a 
high  enough  rating  to  be  chosen  right  away: 


^  Of  court*  "bond”  it  fjti  whtt  AM  typt*  out.  It  r*fl*et*  th*  low  vtki*  of  tho  top  t«*k  on  th*  tgtndt,  tnd  ths  m*ag*r 
rttuitt  obtained  rtctntly  Pl*«*i  forgiv*  tht  tnthropomorphitm;  it  it  tnttnt  only  to  b«  “cut*",  not  mitl*adine. 
AM  h**  no  inttrnti  modtl  which  could  b*  cilltd  Ht  *«motiontl  tttt*“,  i»,  •!.,  PARRY  [Colby  73]  cltim*. 
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*«  Tatk  42  *«  Fill  in  g«n«ralizationt  of  Equal,  because  Equal  is  apparently  quitf  rarely 
satisfied,  and  there  are  no  entries  yet  on  EqualGenl.  Removing  one  of  the  two  conjoined 
recursive  calls  in  the  recursive  definition  given  for  Equal  caused  the  creation  of  Equal- 
except-CARs  and  Equal-except-CDRs.  The  first  predicate  tests  whether  x  and  y  have  the 
same  number  of  elements;  the  second  tests  whether  x  and  y  have  the  same  number  of  leading 
left  parens  and  the  same  first  atom  after  that  final  leading  left  parenthesis.  As  Knuth 
pointed  out,  both  of  these  concepts  are  valid  ways  of  defining  "numbers":  one  counts  the  number 
of  elements,  the  other  counts  the  number  of  leading  lift  parentheses.  But  most  structures 
which  AM  knows  about  are  just  simple  I-level  affairs.  Therefore,  Equal-except-CDRs  was 
allhost  always  the  same  as  "CAR(x}"CAR(y)".  So  AM  never  realized  this  duality.  If  it  had 
pushed  Bl-Sets  and  Bl-B&gs  further,  it  might  have.  Another  concept  created  here  is  far  more 
bizarre.  Instead  of  eliminating  one  of  the  two  conjoined  recursive  calls,  AM  replaced  the 
AND  with  an  OR.  The  new  concept  Genl-EqS  was  defined;  'X(x,y)  if  x  or  y  are  non-lists, 
then  EQ^x.y),  else  lGenl.Eq3(CAR(x),CAR(y))  fiR  Genl-EqS<CDR(x),CDR<y)J’  This  is  true 

if  X  and  y  have  the  same  length,  or  if  the  J^^  element  of  each  is  the  same  (for  any  j),  or  if 

the  j^h  element  of  each  has  the  same  length  (>0),  or  if  the  l’*'  element  of  the  J***  element  of 
each  is  the  same  or  has  the  same  length  (for  any  i,J),  or... 

««  Task  43  ««  Fill  in  •xamples  of  Equal-except*CARt,  because  this  is  a  new  generalization  of 
Equal  which  must  be  examined,  and  because  no  examples  exist  yet.  Only  10  examples  were 
found  before  the  time  quantum  was  exhausted,  but  this  was  still  many  more  than  were 
found  for  Equal  before.  The  user  now  renamed  this  concept  "Same-size".  A  whirlwind  of 
discovery  is  about  to  sweep  the  other  two  generalizations  of  Equal  out  of  the  top  spot  on  AM’s 
agenda  for  quite  a  while.  If  AM  accidentally  picked  another  of  these  to  work  on  before  Same- 
size,  only  a  small  amount  of  time  would  have  bem  spent  before  moving  on.  For  example,  AM 
is  unable  to  perform  Canonlze.Algs(Genl-Eq3JEqual),  so  that  would  be  a  dead-end  right  there. 

*«  Task  44  4t*  Apply  an  Algorithm  for  Canonize  fo  th«  argt  $ama*size  and  Equal,  because  a 
heuristic  rule^  explicitly  suggested  that,  and  there  are  no  known  examples  of  Canonize  yet, 
and  AM  was  Just  working  on  Same-size,  and  AM  was  recently  working  on  Equal,  and 
Same-size  was  recently  created,  and  Same-size  was  Just  renamed  by  the  user.  AM  performs 
several  experiments,  and  eventually  synthesizes  this  canonizing  function;  f(S)  takes  a 
structure  S,  converts  it  to  a  Bag,  and  replaces  each  element  by  T".  This  function  is  later 
renamed  "Size"  by  the  user.  AM  also  defines  the  set  of  canonical  stiuctures;  bags  of  T’s. 
The  user  renames  Bags-of-T’s  as  "Numbers". 

**  Twk  45  **  Restrict  the  Domsin/ringe  fKet  of  Bsg*ur.ion,  because  Bags-of-T’s  (called 
Numbers  now)  is  a  new,  interesting  specialization  of  Bags,  and  a  heuristic  rule  explicitly 
suggested  this,  and  Focus  of  Attention,  and  many  examples  of  B?.g-union  exist.  A  new 
operation  is  defined.  Number-union,  with  domain/range  entry  <Number  Number  -*  Bag>. 
This  task  used  less  than  one  cpu  second. 

*«  Task  46  Fill  in  oxamplos  of  Numbor-union,  because  it  is  recently-interesting,  and  it  was 
Just  created,  and  AM  was  recently  working  on  Domain(Number-uiiion).  Several  examples 
are  found.  At  this  point,  the  author  turned  on  a  tricky  LISP  printing  function,  which 
converted  each  bag  of  T’s  to  base-IO  exponential  notation  before  allowing  it  to  be  typed  out. 


^  Th«  rvli  to  w  m/nbor  213,  on  p«(i  270. 
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**  TmK  47  **  Chsck  th«  domiin/rang*  of  Numbor-union,  because  a  heuristic  rule  explicitly 
suggested  that  the  range  might  really  be  "Number",  and  AM  was  just  working  on  Number- 
union,  and  AM  was  recently  working  on  Domain(Number-union)  [i.e.,  working  on 
‘Numbers’].  In  fact,  what  the  heuristic  rule  suggested  purely  from  symmetry  desires  turned 
out  empirically  to  be  true:  the  value  of  Number-union(x,y)  did  always  appear  to  be  a 
Number  (a  bag  of  all  T’s).  The  result  of  this  was  to  amend  the  Domain/range  facet  of 
Number-union.  Although  AM  regards  this  uniformity  as  very  interesting,  it  has  no  direct 
suggestion  for  what  to  do  next.  The  user  renames  this  operation  "Add2",  since  it  takes 
precisely  2  arguments  (unary  numbers)  and  adds  them. 

»♦  TisK  48  *♦  Resfrief  the  domain/range  of  Bafintaraoct,  to  Numbers, /or  similar  reasons  as 
above.  After  again  noticing  that  the  intersection  of  2  numbers  always  seems  to  be  a  number, 
this  leads  to  the  operation  which  the  user  renames  "Minimum".  Since  the  pattern  of  tasks  is 
Restrict  Fill  in  examples  •*  Check  examples,  there  is  not  much  point  in  listing  all  three 
tasks  for  all  of  these  simple  restrictions.  Each  one  will  only  get  a  single  number  in  this  listing. 
Also,  since  the  reasons  for  these  restrictings  are  pretty  much  the  same,  they  won’t  be  repeated 
for  each  task  below. 

««  Task  49  «»  Restrict  tha  domain/ranga  of  Bag*dalata,  to  Numbers.  The  user  renames  this 
operation  "SUBl",  although  this  is  not  quite  accurate.  If  x  is  not  T’,  then  applying  this 
operation  to  x  and  N  (for  some  number  N  represented  as  a  bag  of  T’s)  will  not  alter  N  at 
all.  AM  does  not  possess  the  reasoning  abilities  needed  to  anticipate  this. 

**  Task  SO  ♦*  Restrict  the  domain/ranga  of  Bag-insort,  to  Numbers.  The  domain/range  entry 
is  changed  to  <Anything  Number  -♦  Bag>.  Renamed  Number-ins, ert.  Although  this  new 
operation  will  in  fact  change  a  number  N,  it  may  not  necessarily  change  it  into  a  number. 
The  last  operation,  SUB  I,  would  always  produce  a  number,  though  it  might  sometimes  fail 
to  change  N  at  all.  Here  is  the  sad  discovery  of  that  asymmetry  about  Number-insert: 

**  Task  51  »*  Chock  the  domain/rango  of  Nombor-insort,  because  a  heuristic  rule  explicitly 
suggested  that  the  range  might  really  be  "Number".  In  fact.  Its  quickly  seen  not  to  be.  This 
operation  is  lowered  in  worth,  and  never  touched  again.  Due  to  AM’s  imperfect  heuristics, 
the  worth  of  SUBl  is  slightly  higher  still  than  this  concept’s. 

**  Task  52  **  Roslrict  tho  domain/rango  of  Bag-difforonco,  to  Numbers.  After  again  noticing 
that  the  difference  of  2  numbers  always  seems  to  be  a  number,  this  leads  to  the  operation 
which  the  user  renames  "Subtract". 

**  Task  53  *«  Fill  in  oxamplos  of  Subtract,  because  none  exist  yet,  and  Subtract  was  just 
created.  Many  examples  are  found.  If  a  larger  number  is  "subtracted"  from  a  smaller,  the 
result  is  zero,  according  to  this  operation.  Thus  about  half  of  these  examples  have  the 
value  zero  (empty  bag).  AM  explicitly  defines  the  set  of  ordered  pairs  of  numbers  having 
zero  difference.  It  turns  out  that  (in  modern  terminology)  <x,y>  is  in  this  new  set  Iff  x  is  less 
than  or  equal  to  y.  So  the  user  renames  this  relation  "LEQ,". 

**  Task  54  ««  Fill  in  oxamplos  of  LEQ,  because  none  exist  yet,  and  LEQ,was  just  created.  26 
examples  found.  When  random  numbers  are  chosen,  the  success  rate  is  (as  we  wise 
observors  know)  a  little  over  50%.  This  is  very  nice  and  AM’s  estimate  of  the  worth  of  LEQ, 
rises. 
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««  Task  55  ««  Check  examplas  ct  LEQ,  because  many  examples  have  recently  been  found,  but 
not  yet  checked,  and  the  worth  of  LEQ,  has  recently  risen.  All  confirmed.  Unfortunately, 
AM  has  derived  Subset  but  not  Subbag,  else  it  might  have  noticed  that  (for  all  numbers  x  and 
y,  represented  as  bags  of  T’s)  xiy  iff  Subbag(x,y).  Then  AM  could  simply  observe  that  LEQ, 
tuas  just  Subbag  restricted  to  Numbers.  Locked  at  another  way,  AM  has  here  discovered  a 
restricted  version  of  the  concept  Subbag. 

**  Task  56  ««  Apply  algorithm  of  Coalasco  to  LEQ,  because  LEQ,  is  an  interesting  operation, 
recently  created,  many  examples  already  exist  for  it,  AM  just  worked  on  LEQ,  LEQ,  takes 
two  of  the  same  argument  (Numbers),  and  no  examples  of  Coalesce  are  known  yet.  The 
new  predicate  is  defined  as  X(x)  x^x.  But  this  is  Always-True,  so  AM  conjectures  that  each 
number  is  LEQ  itself,  and  forgets  the  new  coalesced  version  of  LEQ 

**  Task  57  *«  Fill  in  oxamnlM  of  Parallol*join2,  oecause  none  exist  yet.  IncU'ded  is  Parallel- 
join2(Bags,Bags,Proj2)  (initially  called  MJ2-BBP2),  which  turns  out  to  be  multiplication  of 
two  numbers  and  is  renam^  *TIM£$2"  by  the  user.  Also  included  is  Parallel- 
join2(Structures,Structures,Projl),  which  is  a  generalized  kind  of  Union  operation  (renamed 
"G-Unlon"  by  the  user).  Many  losers  are  also  created,  however,  like  Parallel- 
join2<Bags,Sets,Set-difrerence)). 

**  Task  58  ««  ,  •  €9.  Fill  in  and  check  examples  of  the  operations  just  createad.  Nothing  out 
of  the  ordinary  is  done  here,  just  the  routine  legwork  of  gathering  empirical  data  for  later 
use.  No  startling  conjectures  made. 

**  Task  70  »»  Fill  in  samples  of  Coalasco,  because  none  exist  yet.  One  shining  example  is 
Seif-compose,  which  takes  any  operation  F  (whose  range  is  also  a  domain  component)  and 
forms  FeF.  Another  example  is  Self-Insert,  which  takes  a  structure  S  and  Inserts  S  into  $. 
Also  created  were:  Self-Delete,  Self-Add,  Self-Times,  Self-Union,  etc.  A  different  kind  of 
coalescing  was  done  for  Parailel-replace2,  Parallel-join2,  and  Repeat2;  the  two  structural 
arguments  (the  hrst  and  second  arguments  for  each)  were  merged,  creating  three  new 
operations;  Coa-repeat2,  Coa-join2,  Coa-replace2.  Coa-rep)ace2,  for  example,  takes  a  single 
structure  S  and  an  operation  F,  and  replaces  each  member  x  of  S  by  the  value  F(x,S). 

«*  Tas';  71  **  Fill  in  •xamples  of  Salf-Deioio,  because  none  exist  yet,  and  Self-delete  was  just 
created.  Many  examples  are  found  quite  easily,  of  course,  except; 

**  Task  72  «»  Chock  oxampitt  of  Solf>Doioto,  because  many  examples  have  recently  been 
found,  but  not  yet  checked.  Since  trying  to  delete  S  from  S  will  never  work,  the  value  of 
Delete(S3)  it  just  S  all  the  time.  Self-delete  is  the  tame  as  the  identity  operation.  AM  is 
able  to  discover  this  and  state  it  as  a  conjecture,  obviating  the  need  for  bothering  with  this 
concept  ever  again. 

*•  Task  73  »*  Fill  in  oxamplot  of  Solf-Mombor,  because  none  exist  yet,  and  Self-member  was 
recently  created.  Only  negative  instances  are  found. 
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**  TwK  74  **  Check  eximples  of  S«lf*M«mbar,  because  many  examples  have  recently  been 
found,  but  not  yet  checked.  This  predicate  is  AIways-False,  empirically.  Replace  by  a  conjee: 
Self-Member  is  the  same  as  the  predicate  Always-False;  Member(S,S)-False  for  any  S.  Also, 
an  extra  algorithm  entry  is  added  to  MemberAlg:  Onct  Early  Quick:  \  (x,y}  if  x«y  than  Fals*. 
Also,  a  new  task  is  proposed,  t,o  generalize  Self-Member.  This  never  quite  rises  to  the  top 
of  theagenda,  so  it  is  never  attempted. 

«»  Teak  75  «*  Fill  in  •xampitt  of  Solf-Add,  because  none  exist  yet.  Many  found.  User  renames 
this  "Doubling",  after  he  observes  the  many  examples  which  are  produced. 

*«  Task  76  **  Chock  oxamplot  of  Coalosco,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  All  were  confirmed.  Some  were  already  proving  to  be  interesting,  so 
the  value  of  Coalesce  was  raised. 

»»  Task  77  ««  Chock  oxarnplot  of  Add2,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  All  were  confirmed.  Somewhat  disappointingly,  AM  didn’t  notice 
anything  special  about  Add2  at  the  time. 

**  Y^^k  78  *«  Fill  in  oxamplos  of  Solf*Timos,  because  none  exist  yet,  and  AM  recently  worked 
on  Isa(Self-Times)  [namely,  worked  on  Coalesce].  Renamed  "Squaring"  by  the  user.  20 
examples  found  before  quitting  due  to  lack  of  alotted  space. 

Task  79  **  Fill  in  oxamplos  of  Solf-Composo,  because  none  exist  yet.  Created  Add2oAdd2 
(two  versions;  Ad(^21  which  is  X  (x,y,z)  (x+y)+z,  and  Add22  which  is  x+(y+z)).  Similarly,  two 
versions  of  TIM£>2oTIMES2,  called  TIMES21  and  TIMES22.  Also,  two  versions  of 
ComposeoCompose.  .Some  losers  were  defined  as  well,  like  MembeKMember(x,y),z)  and 
Parallel-Join2(S,R,Parallei-join2(P,Q,F)}  -  the  latter  of  which  accepts  as  arguments  four 
kinds  of  structures  and  a  function  name.  Many  minimally-acceptable  concepts  were  created: 
CoatesceoCoalesce,  SquaringoSquaring,  DoubiingoDoubling,  etc. 

««  Task  80  **  Fill  in  oxamplos  of  Add21,  because  none  exist  yet,  and  Add21  was  just  created. 
This  operation  is  defined  as  X  (x,y,z)  (X4y)^z.  It  is  easy  to  find  examples. 

««  Task  81  **  Fill  in  oxamplos  of  Add22,  because  none  exist  yet,  and  Add22  was  recently 
created.  This  operation  is  defined  as  .X  (x,y,z)  It  is  easy  to  find  examples.  Most  of 

these  examples  are  gotten  from  the  "cousin"  concept  Add2i. 

**  Task  82  ««  Chock  oxsmploi  of  Squaring,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  All  confirmea.  I:  li  unfortunati  that  this  task  Intruded  into  AM's  line 
of  development. 

»«  Task  83  ««  Chock  oxamplos  of  Add22,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  During  this  process,  AM  notices  that  Add21  and  Add^  seem  to  be 
equivalent.  Before  conjecturing  this,  though,  AM  will  do  this  next  task; 


U“-  . 
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»«  Titk  84  *«  Ch«eK  txamplcs  of  Add21,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  this  task  was  spectfcally  suggested  while  AM  was  trying  to  check 
examples  of  Add21.  After  checking  these  examples,  Add21  and  Add22  still  appear 
equivalent.  AM  conjectures  this  and  merges  the  two  operations.  One  consequence  is  the 
boosting  of  the  worth  of  the  new,  combing  operation.  The  most  important  aftereffect  of 
this  is  that  AM  now  knows  that  the  "proper*  argument  for  a  genel-alized  kind  of  addition 
will  be  a  Bag,  not  a  List,  of  numbers.  This  new  kind  of  addition  is  called  Add,  to 
distinguish  it  from  Add2.  Add2  takes  a  pair  of  numbers  and  adds  them,  but  Add  accepts  a 
bag  of  numbers  and  forms  their  sum. 

**  Task  85  a*  Apply  alcorithm  for  invart  to  arfumont  'Add',  because  Add  is  interesting, 
recently  worked  on,  and  has  never  been  inverted,  and  there  are  no  examples  yet  for  Invert, 
and  the  worth  of  Add  has  recently  risen,  and  Add  was  just  created.  By  looking  at  those 
reasons,  we  see  why  some  semantic  processing  should  be  available.  T here  is  tremendous  overlap 
there,  and  the  task  is  not  really  supported  by  as  many  reasons  as  AM  thinks.  AM  defines  Inv- 
add(x)  (also  called  Add'')  as  the  set  of  all  bags  of  numbers  (>0)  whose  sum  is  x. 

«*  Task  86  ««  Fill  in  examples  of  TIMES2i,  because  none  exist  yet.  Defined  as  (x*y)*z.  Many 
are  found. 

«»  Task  87  *»  Fill  in  examples  of  TIME$22,  because  none  exist  yet.  Defined  as  x*(y*z).  Many 
are  found. 

**  Task  88  «»  Cheek  examples  of  TiMES22,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  As  with  Add,  earlier,  TIMES21  and  TIMES22  now  appear 
equivalent.  Before  saying  this,  AM  must  do  this  task; 

»*  Tssk  89  ««  Check  examples  of  TIME$21,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  this  task  was  specifically  suggested  while  AM  was  trying  to  check 
examples  of  TIMES22.  After  checking  these  examples,  TIMES21  and  TIM£$22  still 
appear  equivalent.  AM  conjectures  this  and  merges  the  two  operations.  One  consequence  is 
the  boosting  of  the  worth  of  the  new,  combined  operation.  The  most  important  aftereffect 
of  this  is  that  AM  now  knows  that  the  "proper*  argument  for  a  generalized  kind  of  product 
will  be  a  Bag,  not  a  List,  of  numbers.  This  new  kind  of  multiplication  is  called  TIMES,  to 
distinguish  it  from  TIMES2.  Notice  the  .'ame  property  held  true  for  Add2,  earlier.  AM 
sets  up  an  analogy  between  TIMES  and  ADD,  because  of  this  common  fact.  The  analogy 
itself  is  dose  to  what  mathematicians  call  the  concept  of  Semigroups. 

**  Tick  90  **  Apply  •Igoriihm  for  Invert  to  arfument  'TIMES',  because  TIMES  contains  a  new, 
promising  analogy,  and  the  analog  of  TIMES  has  been  Inverted,  and  TIMES  has  never 
been  inverted,  and  the  worth  of  TIMES  has  recently  risen,  and  TIMES  was  just  created. 
AM  defines  Inv-TIMES(x)  (also  called  TIMES"')  as  the  set  of  all  bags  of  numbers  (>1) 
whose  product  is  x.  AM  noted  that  TIMES*'  should  probably  be  analogic  to  Add*'. 


App«ndix  5 


AM  Ditcoviry  in  Mithtmttic*  it  Hturitiic  Stireh 


-305- 


**  Titk  91  «*  Fill  in  txampie*'  Parall«l>r«plae«2,  because  none  exist  yet.  /  could  kick  AM 
for  doing  this  now!!  The  priority  T>'.Hng  of  this  task  happened  to  place  it  above  all  the  others, 
including  those  with  extra  bonuss'  ■  .-Kause  of  focus  of  attention.  T  his  task  is  merely  diverting, 
not  harmful  in  any  lasting  sense,  ait  it  does  degrade  the  apparent  level  of  purposefulenss  of 
the  system.  Several  examples  of  Parailel-replace2  arc  found.  Included  are  Parallel- 
repIace2(Bags,Bags.Pi‘oj2)  (called  MR2-BBP2),  at.d  many  losers.  MR2-BBP2(S1,S2) 
replaces  each  element  in  SI  by  a  full  copy  of  the  whole  of  si  This  is  the  way  that  Skemp 
suggests  developing  the  notion  of  multiplication  —  and  in  fact  AM  will  (in  task  109)  derives 
an  operation  which  is  equivalent  to  TlMES2just  by  unioning  the  results  of  this  operation.  In 
task  127  AM  realizes  that  this  is  in  fact  just  multiplication,  and  merges  those  two  operations, 
concurrently  boosting  the  worth  of  that  combined  concept  greatly. 

at*  Task  92  »»  , .  107.  Fill  in  and  check  examples  of  the  operations  just  created.  Nc.hing 
really  worth  our  time  (or  AM's).  Sigh. 

**  Taak  108  *«  Fill  in  axamplaa  of  Compoao,  because  none  exist  yet.  It  is  very  easy  to  create 
new  compositions  -  most  of  them  losers.  Some  of  the  concepts  produced  (e.g.,  SiieoAdd*') 
were  valuable  but  were  lost  amid  the  mass  of  losers  (e.g.,  InsertoEqual).  Because  of  this 
fkxxl  of  poorly-motivated  new  concepts,  a  heuristic  triggers  which  has  AM  create  a  new 
specialization  of  Compose,  called  Int-Compose,  by  conjoining  onto  Compose.Defn  a  few  of 
the  features  from  Compose.Interest.  The  Worths  of  the  new  compositions  just  created  are 
all  lowered,  so  that  the  (future)  examples  of  Int-Compose  will  predominate.  T he  task  first 
considered  in  TASK  1  has  finally  bubbled  back  up  to  the  top  of  the  agenda,  and  has  proved  to 
be  quite  worthwhile. 

**  Task  109  «*  Fill  in  axamplAt  of  Inf-Composa,  because  none  exist  yet,  and  Int-Compose  was 
just  created,  and  any  example  of  Int-Compose  is  automatically  an  interesting  example  of 
Compose,  and  the  worth  of  Int-Compose  is  very  high.  The  two  chosen  operations  G,H 
must  be  such  that  ran(H)<dom(G),  and  ran(G)<dom(H);  both  G  and  H  must  be  interesting 
or  at  least  newly-created.  Well,  two  operations  recently  dealt  with  are  G-Union  and  MR2- 
BBP2.  Since  the  range  of  MR2-BBP2  is  ‘Bags  of  Bags’,  it  is  precisely  equal  to  the  domain 
of  the  newly-synthesized  operation  G-Union.  So  one  composition  considered  is  G- 
UnionoMR2-BBP2.  This  is  an  alternate  derivation  of  the  operation  of  multiplication.  Also 
included  are:  TIMESoSquaring,  CoalesceoCompose,  InSertoDelete,  Deleteolnsert, 
.Add2oTlmes2,  etc.  /.  Ithough  most  of  these  operations  were  never  investigated  very  much, 
they  are  much  better  than  the  random  compositions  produced  during  the  previous  task. 
This  seems  clear  even  to  AM,  even  before  studying  them  very  much. 

**  Twk  110  «*  ,  -  126.  Fill  in  and  check  examples  of  the  compositions  just  createad. 
Nothing  of  great  interest  until...: 

*♦  Task  127  *♦  Cheek  examples  of  G-UnionoMR2-BBP2,  because  many  examples  have  recently 
been  found,  but  not  yet  checked.  AM  discovers  that  this  operation  is  equivalent  to  MJ2- 
BBP2  (i.e.,  TIMES2).  Since  they  arose  in  very  different  ways,  the  worth  of  the  new,  merged 
concept  module  is  greatly  increased,  as  is  that  of  the  more  general  operation  TIMES. 
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**  Tpsk  128  **  Fill  in  •ximplti  of  Coa*r«p«at2.  because  none  exist  yet.  26  operations 
synthesised.  Foremost  among  them  was  Coa-repeat2(Bags-of-Numbers^dd2),  which  turned 
out  to  be  yet  another  derivation  of  muitipHcation!  Also  produced  was  Coa-repeat2(Bags-cf- 
Numbers, Times)  -  a  definition  of  exponentiaion.  Others  included  Coa- 
repeat2(Structures,Projl),  which  turns  out  to  be  the  same  as  First-element-of,  i.e..  CAR.  and 
Coa-repeat2<Structures,ProJ2},  which  turns  out  to  be  Last-element-of. 

**  Taak  129  **  Chack  tha  axamplai  of  Coa-rapaat2,  because  many  examples  have  recently 
been  found,  but  not  yet  checked,  and  Focus  of  Attention.  All  coniimed. 

**  Taak  130  **  Apply  algorithms  for  Invert  to  'Doubling',  Doubling  is  interesting,  and  it  has 
never  been  invert^.  The  result  is  called  "Halving"  by  the  user.  AM  decided  to  isolate  the 
domain  of  Halving  (the  range  of  Doubting).  Such  numbers  are  renamed  by  the  user  as 
"Evens".  Although  pltased  with  tht  usult  of  this  task,  it  was  somtwhat  jarring  in  the  context 
of  the  preceding  development. 

«*  Taak  131  ««  Fill  in  axamplaa  of  Salf'lnaart,  because  none  exist  yet.  AM  has  apparently  lost 
the  "thread"  of  a  development  and  is  wandering  around,  taking  care  of  only  moderately 
promising  tasks.  Many  examples  of  this  operation  are  found. 

««  Taak  132  **  Chock  oxamploa  of  Solf'lnsort,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Nothing  special  found.  The  result  is 
never  the  same  as  the  argument. 

««  Taak  133  *«  Fill  in  oxamploa  of  Coa-ropoat2-Add2,  because  none  exist  yet.  Many  found 
quickly,  but  at  a  large  cost  in  terms  of  storage  space. 

Taak  134  **  Chock  oxamploa  of  Coa’-ropoat2*Add2,  because  many  examples  have  recently 
been  found,  but  not  yet  checked,  and  Focus  of  Attention.  Confirmed.  AM  noticed  it’s  the 
same  as  TIMES.  Boost  the  worth  of  TIMES  even  higher,  far  above  that  of  any  other 
concept.  AM  will  stay  interested  in  TIMES  for  most  of  the  future  of  this  run. 

«*  Taak  135  *«  Apply  algorithm  lor  Invert  to  argument  'Squaring',  Squaring  is  interesting,  has 
never  been  inverted,  is  related  to  the  very  interesting  concept  TIMES,  is  related  to  the  very 
interesting  concept  Coalesce,  is  analogic  to  the  already-inverted  concept  Doubling.  AM 
defines  Inv-square(x)  as  the  number*  whose  square  is  x.  Renamed  by  user  as  "Square-root". 

««  Taak  136  *«  Fill  in  oxamploa  of  Squaro-root,  because  none  exist  yet,  and  Square-root  was 
just  created,  and  Square-root  is  related  to  the  very  interesting  concept  TIMES.  AM  spent 
quite  a  while  on  this  task,  and  only  aboui  10  examples  were  found  (discounting  duplicates). 
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»♦  T«tK  137  **  Fill  in  n*w  iltorithm#  for  Squiro-root,  because  Square-root.Algs  are  all  too 

slow,  and  have  been  called  on  a  great  deal  recently.  jIM  has  some  zero^^  order  rules  for 
improving  algorithms,  backed  up  by  a  marvelous  tactic;  ask  the  user.  In  this  case,  AM  asks 
the  user  for  a  better  algorithm,  and  he  supplies  one.  Of  course,  the  new  algorithm  is 
cornpletely  opaque  to  AM.  The  user  never  tells  AM  how  to  do  something  unless  it  had  a  (slow) 
way  to  do  that  thing  already.  One  fast  new  algorithm  now  exists. 

TwK  138  **  Check  examples  of  Square-root,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  AM  is  plagued  by  the  frequency  of 
numbers  not  having  square-roots,  so  it  isolates  those  that  do.  It  defined  the  set  of  numbers 
having  a  square-root,  and  this  concept  was  renamed  "Perfect-squares"  by  the  user. 

»*  Task  139  *♦  Fill  in  axamplas  of  Coa-r«paat2-Tima8,  because  none  exist  yet,  and  this 
concept  is  related  to  the  very  interesting  concept  TIMES.  A  moderately  rational  thing  to 
investigate.  Examples  are  easily  found,  but  they  take  up  a  lot  of  space. 

*♦  Task  140  ♦*  Check  examples  of  Coa-repeaf2-Times,  because  many  examples  have  recently 
been  found,  but  not  yet  checked,  and  Focus  of  Attention.  Nothing  special  noticed, 
unfortunately  (this  is  exponentiation,  folks).  If  the  user  interrupts  and  tells  AM  that  this  is 
realty  interesting,  AM  soon  creates  the  specialization  of  it  defined  as  Expon(x,2),  and  then 

AM  notices  that  this  is  just  squaring.  I.e.,  x^-x'x;  the  base  tie  between  exponentiation  and 
multiplication.  On  its  own,  AM  doesn't  rate  Coa-repeat2-Times  high  enough  to  start  this  chain 
of  discoveries. 

♦*  Task  141  *♦  Fill  In  examples  of  Inv-TIMES,  because  none  exist  yet,  and  Inv-TIMES  is 
related  to  the  very  interesting  concept  TIMES.  Many  found. 

»»  Task  142  **  Fill  in  new  algorithms  for  Inv-TIMES,  because  Inv-TIMES.Algs  are  all  too  slow, 
and  have  been  called  on  a  great  deal  recently,  and  TIMES*'  is  related  to  the  very 
interesting  concept  TIMES,  and  Focus  of  Attention.  AM  asks  the  user,  who  supplies  a 
decent  recursive  algorithm  for  this  function. 

»»  Task  M3  *#  Check  examples  of  Inv-TIMES,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  This  proceeds  along,  and  all  are 
confirmed.  A  heuristic  rule  notices  that  the  domain/range  is  <Number  -»  Sets-of-Bags-of- 
Numbers>;  it  searchs  for  an  operation  whose  Domain/range  facet  contains  an  entry 
(compatible  with)  <Sets-of-Bags-of-Numbers  -»  Number>,  and  fails;  next  it  looks  for  an 
operation  whose  dom/range  is  <Sets-of-Bags  -♦  Set  or  Bag>,  and  finds  that  G-Union  fills  the 
bill.  Therefore,  the  rule  suggests  the  following  task  (with  a  high  rating): 

♦♦  Task  144  *♦  Apply  Compose  algorithm  to  G-Union  and  Inv-TIMES,  because  the  three  concepts 
involved  are  Interesting,  related  to  TIMES,  and  this  task  was  specifically  suggested  by  the 
preceding  one.  The  composition  is  created,  as  specified  in  the  task.  This  new  operation  has 
domain/range  <Number  -♦  Set-of-Numbers>,  and  is  thus  given  a  higher  rating  than  either 
of  its  constituents.  It  is  renamed  "Divisors"  by  the  user. 
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*«  Ti«k  145  «*  Fill  In  axamplts  of  Divitort,  because  none  exist  yet,  and  Divisors  is  related  to 
the  very  interesting  concept  TIMES,  and  divisors  was  Just  created.  Many  examples  are 
found,  but  only  after  much  inefficient  searching  amid  the  set  of  all  (known  examples  of) 
numbers. 

*«  Twk  146  *»  Fill  In  now  •Ifcrilhmi  for  Divitort,  because  Divisors.Algs  are  all  too  slow,  and 
have  been  called  on  a  great  deal  recently.  AM  asks  the  user,  who  supplies  a  decent  iterative 
algorithm  for  this  function. 

«*  TttK  147  ««  Fill  in  oxtmplot  Of  Porfocl-tqutrot,  because  none  exist  yet,  and  Perfect- 
squares  is  related  to  the  very  interesting  concept  TIMES.  15  found,  after  which  the  space 
allocation  was  exhausted. 

one  Tttk  148  »«  Fill  in  tpocitliztiiont  of  TIMES,  because  TIMES  is  very  interesting,  has  very 
few  known  specializations,  and  it  was  very  easy  to  find  examples  of  TIMES.  AM  now 
allocates  a  huge  chunk  of  epu  time  and  space  to  this  task.  A  few  specializations  of  TIMES 
are  gotten  by  plugging  in  a  distinguished  value  for  one  argument:  TimesI(x)El«x, 
Times0(x)*0*x,  etc.  Other  new  operations  are  simply  TIMES  with  its  domain  restricted  to  a 
bag  of  special  numbers:  Times-sq  has  its  domain  a  bag  of  perfect  squares,  Times-ev  takes 
only  even  arguments,  etc.  Others  (inefficient  to  compute)  are  TIMES  with  its  range 
restricted:  Times-to-evens  requires  that  the  result  be  even,  Times-to-sq  for  square  results,  etc. 

Task  149  «*  Check  cxamplet  of  Divitort,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Divisors  is  related  to  the  very  interesting  concept  TIMES. 
Often,  Divisors(x)  is  interesting  (to  AM)  as  a  set;  AM  isolates  the  cases  by  defining  0-Div,  1- 
Dlv,  2-Div,  and  S-Div,  the  seu  of  numbers  whose  Divisors  value  is  the  empty  set,  a 
singleton,  a  doubleion,  and  a  tripleton,  respectively.  AM  will  gradually  partition  the 
examples  of  Divisors  into  these  categories,  as  AM  tries  to  fill  in  examples  of  each  kind  of 
number. 

.  This  is  the  point  where  the  example  in  Chapter  2  begins,  and  is  also  roughly  the 
point  where  the  unadulterated  LWP  trace  (Appendix  SJ)  ends.  Both  this  section 
and  the  earlier  condensed  task-by-task  trace,  found  in  Chapter  6,  go  further. 

*«  T«sk  150  **  Fill  in  examples  of  1-Div,  because  none  exist  yet,  and  I-Div  was  Just  created, 
and  is  related  to  the  very  interesting  concept  TIMES.  Only  one  example  found:  "1".  This 
causes  the  Worth  of  1-Div  to  be  lowered. 

**  Task  151  **  Fill  in  examples  of  O-Div,  because  none  exist  yet,  and  0-Div  was  recently 
created,  and  is  related  to  the  very  interesting  concept  TIMES.  None  found.  Lower  the 
worth  of  this  concept. 

«*  Task  152  «*  Fill  In  examples  of  2*Div,  because  none  exist  yet,  and  this  concept  is  related  to 
the  very  interesting  concept  TIMES.  About  19  are  found  (out  of  about  170  attempts).  This 
is  a  nice  ratio,  a  nice  density  within  the  natural  numbers  -  not  too  many  nor  too  few  to  be 
interesting.  As  a  result,  2-Div.Worth  is  slightly  raised. 
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**  Titk  153  **  Check  txamplet  of  2*Div,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  the  worth  of  2-div  has  Just  increased,  and  Focus  of  Attention. 
The  existing  examples  were  confirmed,  but  no  patterni  noticed.  This  was  heart-stopping, 
since  2-Dio  is  the  notion  of  prime  numbers;  here  AM  is  tossing  it  off  as  r.on-interest-catchingl 

*♦  Twk  154  *♦  Fill  In  oximploi  of  3-Div,  because  none  exist  yet,  and  3-Div  is  related  to  the 
very  Interesting  concept  TIMES.  As  with  2-Div,  a  nice  number  of  examples  were  found 
(albeit  on  the  scarce  side  of  nice). 

Talk  155  Chock  oxamplot  of  3*0iv,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  3-Div  is  related  to  the  very  interesting  concept  TIMES.,  and  Focus 
of  Attention.  All  confirmed.  All  are  perfect  squares!  Very  unexpected  (both  by  AM  and 
the  user).  AM  greatly  Increased  the  worth  of  3-Dlv.  One  suggestion,  due  to  the  fact  that  3- 
Div  was  now  In-dom-of  Square-root,  was: 

»♦  Task  156  ♦♦  Rostrict  Squaro-root  to  numbora  which  art  in  3-Div,  Square-root  is  interesting, 
3-Div  is  very  interesting,  and  the  preceding  task  specifically  requested  this  action.  AM  calls 
the  new  concept  Root3. 

**  Task  157  «*  Fill  in  oxamplos  of  Roots,  because  none  exist  yet,  and  Root3  was  just  created, 
and  Roots  is  related  to  the  very  interesting  concept  3-Div.  Many  examples  found.  In  fact,  it 
was  easy  to  take  the  square-root  of  each  known  example  of  3-Div. 

**  Task  158  **  Chock  oxamplot  of  Roots,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  All  confirmed.  Each  result  turned  out  to  be  a  2-Div  type  of  number. 
Very  surprising.  Conjecture:  the  square-root  of  a  number  with  3  divisors  is  a  number  with 
2  divisors.  AM  raised  the  worths  of  all  the  concepts  involved.  At  this  point,  the  user 
renamed  2-Divs  as  "Primes". 

**  Task  159  ♦»  Rostrict  Squaring  to  Primoi,  Squaring  is  interesting.  Primes  is  recently 
interesting,  and  the  preceding  task  specifically  suggested  this  action.  AM  calls  the  result 
Square2. 

**  Tatk  160  **  Fill  in  oxamplot  of  Squaro2,  because  none  exist  yet,  and  Square2  was  just 
created.  Many  found. 

*«  Tatk  161  **  Chock  iho  domain/rtngo  facot  of  $quaro2,  it  has  been  specifically  suggested 
that  the  range  of  Square2  may  be  3-Div,  and  3-Div  is  very  interesting,  and  Square2  is 
related  to  the  interesting  concept  Primes,  and  Focus  of  Attention.  As  hoped  for,  all  are  3- 
Divs.  Conjecture:  x  is  a  Primes  iff  its  square  is  a  3-Div  iff  it  is  the  square-root  of  a  3-Div. 

♦♦  Tatk  162  ♦*  Rostrict  Squaring  to  3-Divt,  Squaring  is  interesting,  3-Div  is  interesting,  and 
an  earlier  task  specifically  suggested  this  action.  The  result  is  called  Square3.  AM’s  past 
few  successful  tasks  have  now  incremented  the  Worths  of  certain  activities  above  their  true 
value:  AM  will  now  be  tied  up  with  restricting  Stjuaring  and  Square-rooting  to  all  the 
concepts  involved.  The  net  effect  will  be  to  lower  those  inflated  worth  ratings,  and  to  lower  the 
user’s  —  and  the  reader’s  -  opinion  of  AM. 
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**  Tisk  163  **  Restrict  Squir«*rooting  to  Primes,  Square-root  is  interesting,  Prinies  is 
interesting,  and  an  earlier  task  specifically  requestd  this  action.  Call  the  result  Root2. 

**  Task  164  **  Fill  in  examples  of  Squares,  because  none  exist  yet,  and  SquareS  is  related  to 
the  interesting  concept  3-Div,  and  Squares  was  recently  created.  Only  9  examples  found 
before  running  out  of  space.  By  analogy,  since  Divi$or$-ofo$quare2  was  interesting,  AM 
considers: 

««  Task  165  Compose  Divisors-of  and  Squares,  Analogic  to  tripletons,  and  Divisors-of  is 
interesting,  and  Squares  is  interesting,  and  the  preceding  task  specifically  suggested  this 
action.  AM  calls  the  result  Div-SqS. 

««  Task  166  **  Fill  in  examples  of  OivSqS,  because  none  exist  yet,  and  Div-SqS  was  just 
created.  9  examples  found  right  away,  by  simply  running  Divisors-of.Algs  on  the  9  known 
examples  of  Squares. 

«»  Task  167  Check  examples  of  DivSqS,  because  many  examples  have  recently  been 
found,  but  not  yet  checked.  All  such  examples  are  Same-size.  Although  AM  doesn’t  have 
the  notion  of  ’^ness’  explicitly,  they  each  have  5  members.  A  specialized  hack  heuristic 
observes  the  general  pattern:  DivisorsoPrimes  are  all  of  the  same  size; 
DivisorsoSquaringoPrimes  are  ail  of  the  same  size;  DivisorsoSquaringoSquaringoPrimes  are 
all  of  the  same  size;  A  new  conjecture  is  formulated  and  typed  to  the  user: 
DivisorsoRepeat(Squaring)oPrimes  will  all  be  the  same  size. 

This  txpresses  the  fact  that,  for  a  given  n,  has  the  same  number  of  divisors  for 
each  prime  p.  AM  was  not  able  to  figure  out  that  number  of  divisors  (it  is  2n*J). 
This  would  be  a  trivial  sequence  extrapolation  problem,  but  AM  of  course  had  no 
heuristics  for  dealing  with  numbers,  hence  no  sequence  extrapolation  techniques. 
Deriving  the  concept  of  sequence  extrapolation  itself  would  have  been  quite 
astounding,  but  never  occurred.  Discovering  the  concept  of  inductive  inference  and 
studying  it  explicitly  in  isolation  is  quite  a  sophisticated  achievement  -  that’s  what 
A!  researchers  spend  much  of  their  time  trying  to  accomplish.  T his  is  one  time  when 
AM  was  much  further  from  discovering  a  theorem  than  it  appeared  to  the  casual 
observer. 

»»  Task  168  *«  ,  -  175.  More  confirmations  and  explorations  of  the  above  conjecture. 
Gradually,  all  its  ramifications  lead  to  dead-ends  (as  far  as  AM  is  concerned). 

»«  Task  176  Fill  in  axamplas  of  Root2,  because  none  exist  yet,  and  Root2  is  related  to  the 
interesting  concept  Primes.  But  no  examples  at  all  are  found.  This  is  not  surprising,  since 
very  few  primes  are  also  perfect  squares.  AM  conjectures  that  there  are  none.  Worth  of 
Root2  is  lowered. 

**  Task  177  »*  Ch<ick  axamplas  of  Inv-TIMES,  because  many  examples  have  recently  been 
fouiid,  but  not  yet  checked.  This  is  a  break  in  the  previous  smooth  line  of  development. 
Inv-TIMES  appears  to  always  contain  a  singleton  bag;  in  fact,  Inv-TIMES(x)  always 
contains  the  singleton  bag  (x).  Another  con^cture  AM  makes:  Inv-TIMES(x)  always 
contains  a  bag  of  primes.  This  last  hypothesis  suggests  the  following  two  tasks: 
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**  Titk  178  **  Restrict  the  range  of  Inv-TIMES  to  bags  of  primes,  because  Inv-TIMES  is 
interesting,  is  related  to  the  very  Interesting  concept  TIMES,  primes  are  more  Interesting 
tiian  numbers  at  the  moment,  Focus  of  Attention,  and  the  previous  task,  specifically 
suggested  this  action.  AM  calls  the  new  result  Prime-Times. 

♦♦  Task  179  *»  Rsstrict  th#  rang#  of  Inv-TIMES  to  ainglotons,  because  Inv-TIMES  is 
interesting,  is  related  to  the  very  interesting  concept  TIMES,  singletons  are  more  interesting 
than  sets  at  the  moment,  a  recent  task  specifically  suggested  this  action,  and  Focus  of 
Attention.  AM  calls  the  new  result  Single-Times. 

**  Task  180  »»  Fill  in  examples  of  Prime-times,  because  none  exist  yet,  and  Prime-times  was 
recently  defined,  and  Prime-times  is  related  to  the  interesting  concept  Primes,  and  Prime¬ 
times  is  related  to  the  interesting  concept  TIMES.  Many  examples  are  found. 

Task  181  acac  Chock  oxamplos  of  Prime-times,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention,  and  Prime-times  is  related  to  the  very 
interesting  concept  TIMES.  The  value  of  Prime-times(x)  is  always  a  singleton  set. 
Conjecture:  Inv-TIMES(x)  contains  precisely  one  bag  of  primes.  User  renames  this 
conjecture  The  unique  factorization  theorem".  AM  prints  out  that  this  will  probably  be 
very  natural  and  important.  The  reason  for  this  is  that  Primes -was  itself  derived  from 
TIMES,  so  any  conjecture  connecting  them  is  quite  natural.  Any  unexpected  such  natural 
conjecture  will  probably  be  useful. 

*«  Task  182  *«  Fill  in  examples  of  Single-TIMES,  because  none  exist  yet,  and  this  concept  is 
related  to  the  very  interesting  concept  TIMES.  Many  found. 

**  Task  183  **  Cheek  examples  of  Single-TIMES,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  it  is  related  to  the  very  interesting  concept  TIMES,  and 
Focus  of  Attention.  The  value  of  Single-times(x)  is  always  a  singleton  set.  Conjecture:  Inv- 
TIMES(x)  contains  precisely  one  singleton  bag.  Single-TIMES  is  actually  the  same  as  Bag- 
insert,  in  the  sense  that  both  Single-TIMES{x)  and  Bag-insert(x)  give  the  value  (x)  —  the 
bag  containing  only  x.  In  the  latter  case,  this  is  because  Bag-insert  is  "smart"  enough  to 
supply  an  empty  bag  as  the  second  argument  S  to  Bag-insert(x,S),  if  S  is  missing. 

»»  Tisk  184  **  Fill  in  oxamplos  of  Solf-sof-union,  because  none  exist  yet.  AM  has  dropped  the 
momentum  of  its  previous  whirlwind  of  discovery,  and  is  simply  marking  time,  gathering 
evidence.  Many  examples  are  found. 

**  Task  185  at*  Chock  oxamplos  of  Solf-sot-union,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Apparently,  this  concept  is  the  same  as 
Identity  (but  with  domain/range  restricted  to  Sets).  Replace  by  a  conjecture. 

♦*  Task  186  **  Fill  in  oxamplos  of  Solf-bag-union,  because  none  exist  yet.  On  the  same  track, 
but  boring.  Many  found. 

**  Task  187  *«  Chock  oxamplos  of  Solf-bag-union,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  All  are  confirmed.  Nothing  interesting 
noticed. 
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TatK  188  **  Fill  in  •xampiti  of  Inv-ADD,  because  none  exist  yet.  Slowly,  examples  were 
found. 

»«  TatK  189  *»  Chock  oxamplot  of  InvADO,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Inv-ADD(x)  always  contains  a 
singleton  bag  (x),  a  doubleton  bag,  a  tripleton,  a  bag  of  i’s,...  So  many  conjectures  that: 

««  To«k  1 90  »»  Rottrict  fh#  domain  of  InvADD,  because  Inv-Add  is  interesting,  related  to  the 
interesting  concept  Add,  Focus  of  Attention,  and  the  previous  task  specifically  suggested  this 
action.  When  the  domain  is  restricted  to  primes,  AM  defines  ‘Inv-Add-primes’.  When  it 
restricts  Inv-Add  to  work  only  on  evens,  AM  thereby  defines  the  operation  it  calls  'Inv-Add- 
evens’. 

**  Ta«k  191  Fill  in  oxampiet  of  Inv-add^primta,  because  none  exist  yet,  and  this  concept 
was  just  defined,  and  Focus  of  Attention.  Many  found. 

**  Task  192  ««  Chock  oxamploa  of  Inv-add-primos,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  All  were  confirmed,  but  nothing  special 
noticed. 

**  Task  1 93  ««  Fill  in  oxamplos  of  invadd-ovons,  because  none  exist  yet,  and  this  concept  was 
recently  defined.  Many  examples  found. 

**  Task  194  ««  Chock  oxamplos  of  lnvsdd*ovons,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Confirmed.  Inv-Add-evens(x)  always 
contains  a  bag  of  primes.  This  is  mildly  surprising,  and  prompts:  ' 

**  Task  195  ««  Rostrlci  tho  range  of  irtv-Add-ovona  to  bags  of  primes,  because  Inv-Add-evens 
is  recently  interesting,  and  Primes  is  more  interesting  than  Numbers,  and  the  previous  task 
specifically  requested  this  action  (hence  Focus  of  Attention).  AM  names  the  new  operation 
Prime-ADD. 

««  Task  196  *«  Restrict  tho  range  of  InvADD  to  singletons,  because  Inv-Add  is  interesting, 
singletons  are  more  Interesting  than  sets,  AM  just  worked  on  Inv-Add,  AM  recently  worked 
on  Inv-Add,  and  an  earlier  task  specifically  suggested  this  action.  Thus  Single-Add  is  born. 

Task  197  »*  Fill  In  oxamplos  of  Prime-ADD,  because  none  exist  yet,  and  Prime-add  was 
recently  defined.  Many  found. 

**  Task  198  ««  Chock  oxamplos  of  Prime-ADD,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  The  value  of  Prime-ADD(x>  is  always  a 
nonempty  set  (of  bags  of  primes).  So  conjectured  (domain/range  changed).  User  renames 
this  conjecture  "Goldbach’s  conjecture". 

Task  199  »$  rill  in  oxamplos  of  Singlo-APD,  because  none  exist  yet,  and  this  concept  was 
recently  defined.  Many  found. 
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**  Titk  200  *t  Cheek  examples  of  $in|le*ADD,  became  n'lany  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  The  value  of  SingleUDD(x)  is  always 
a  singleton  set.  Conjecture:  Inv-ADD(x)  contains  precisely  one  singleton  bag.  Single-ADD  is 
actually  the  same  as  Bag-m-sert  (and  Single-TIMES). 

*#  Task  201  *»  Restrict  the  range  of  Prime-ADD  to  singletons,  because  analogic  to  Prime- 
TIMES,  and  Prime-Add  is  interesting,  and  Singletons  is  more  interesting  than  Sets.  This 
was  initiated  by  analogy,  not  by  an  earlier  task  specifically  suggesting  that  the  restriction  be 
done.  In  this  case,  AM  is  asking  which  numbers  are  uniquely  representable  as  the  sum  of 
two  primes.  The  new  operation  is  Prime-ADD-s. 

««  Task  202  »»  Fill  in  axsmplas  of  Prima*AD0>s,  because  none  exist  yet,  and  Prime-ADD-s 
was  just  defined.  Many  examples  are  found,  but  after  a  nontrivial  processing  effort. 

**  Task  203  **  Chock  axamplas  of  Primo>A0D*s,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Nothing  special  noticed. 

»♦  Task  204  **  Fill  in  axamplas  of  Timas-sq,  because  none  exist  yet.  Losing  the  thread  of 
discovery,  moving  back  to  data-gathering  blindness.  Recall  that  Times-sq  is  just  TIMES 
restrict^  to  operate  on  perfect  squares.  Many  examples  found. 

»»  Task  20S  **  Chack  domain/ranga  of  Timas-iq,  because  the  range  of  this  operation  may 
actually  be  Perfect-squares,  and  examples  of  Times-sq  were  just  filled  in,  and  this  concept  is 
related  to  the  very  interesting  concept  TIMES,  and  Focus  of  Attention.  The  range  really 
does  seem  to  be  as  hoped  for.  Conjecture:  the  product  of  perfect  squares  is  a  perfect  square. 

**  Task  206  »*  Fill  in  axamplas  of  Timasl,  because  none  exist  yet.  Recall  that 
Times l(x)«TIMES(l,x).  Many  found. 

Task  207  ««  Chack  axamplas  of  Timasl,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  Focus  of  Attention.  Apparently  Timesl  is  just  a  restriction  of 
Identity.  Timesl  is  therefore  replaced  by  a  lone  conjecture:  Time$(x,l)-x. 

»»  Task  208  ««  Chack  axsmplas  of  Timas-sq,  because  many  examples  have  recently  been 
found,  but  not  yet  checked.  Confirmed. 

**  Task  209  ««  Fill  in  axamplas  of  TimasO,  because  none  exist  yet,  and  TimesO  is  related  to 
the  very  Interesting  concept  TIMES.  Many  found. 

ast  Task  210  **  Fill  in  axamplas  of  Tima82',  because  none  exist  yet,  and  this  operation  is 
related  to  the  very  interesting  concept  TIMES.  Many  found.  Recall  that  Times2(x)  is 
defined  as  2*x. 

Task  211  «»  Chack  axamplas  of  Timas2',  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Apparently,  Times2’  is  the  same  as 
Doubling.  That  is,  x+x»2'x.  A  very  powerful  tie  between  Add  and  Times!  This  was 
highly  unexpected.  It  is  not  predicted  by  the  existing  analogy.  By  analogy,  AM  now  defines 
Ad2(x}  as  x«2,  and  will  invesitigate  that. 
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**  Task  212  **  Fill  in  •xttnpits  of  Ad2,  because  none  exist  yet,  and  Ad2  was  Jsst  created. 
Many  examples  found. 

»«  Task  213  **  Chock  oxampitt  of  Ad2,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  Focus  of  Attention.  Nothing  interesting  noticed.  AM  didn't  have 
the  notion  of  AddhAddI  at  that  moment,  or  it  could  have  derived  the  analogic  conjecture 
between  successor! addition  that  it  found  between  additionimultiplication.  The  same  lack  of 
knowledge  about  exponentiation  inhibited  the  perception  the  timeslexponent  analogic 
relationship.  Every  little  bit  of  knowledg  about  operations  involving  Add  served  to  raise  the 
worth  of  Add  slightly.  Finally,  the  following  task  rises  to  the  top: 

**  Task  214  Fill  in  spocializations  of  Add,  because  there  are  very  many  examples  of  Add, 
and  Add  has  recently  risen  in  interest.  Among  those  created  are:  AddO  (x+0).  Add],  Add3, 
Add-sq  (addition  restricted  to  perfect  squares),  Add-ev  (sum  of  even  numbers),  Add-pr  (sum 
of  primes),  etc.  The  techniques  used  were  the  same  ones  used  to  specialize  TIMES  earlier. 

**  Task  215  »«  Chock  oxamploi  of  TimoiO,  because  many  examples  have  recently  been  found, 
but  not  yet  checked.  The  value  always  seems  to  be  0.  So  conjKtured.  TimesO  goes  away. 

»»  Task  216  »«  Fill  in  oxamplos  of  Timot*ov,  because  none  exist  yet.  Recall  that  Times-ev  is 
just  like  TIMES  restricted  to  operating  on  even  numbers.  Many  examples  found. 

**  Task  217  »*  Chock  oxamplos  of  Timos'ov,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Apparently  all  the  results  are  all  Evens. 
Conjecture:  the  product  of  even  numbers  is  even.  Note  that  no  explicit  task  of  the  form 
"Check  DIR"  had  to  be  executed  in  order  to  notice  and  correct  this.  The  net  result  here  is  the 
same:  a  conjecture  and  an  alteration  of  the  Domainfrange  facet  of  Times-ev. 

**  Task  218  *«  Fill  in  oxamplos  of  Timos>t0'ov,  because  none  exist  yet.  That  is,  consider  bags 
of  numbers  which  multiply  to  give  an  even  number.  Many  found. 

**  Task  219  «*  Fill  In  oxamplos  of  TimosMo’sq,  because  none  exist  yet.  Find  bags  of  numbers 
whose  product  is  a  perfect  square.  A  few  found  right  away,  then  great  difficulty  in  locating, 
more  examples. 

»*  Task  220  **  Chock  oxamplos  of  Timos<'to'sq,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  All  arguments  always  seem  to  be 
squares.  Conjee:  Tlmes-to-$q  is  really  the  same  as  Times-sq.  Merge  the  two.  This  is  a  false 
conjecture,  since,  e.g.,  the  product  of  the  numbers  in  the  bag  (2  2  S  is  a  perfect  square,  but 
none  of  those  numbers  is  itself  a  square.  This  did  AM  no  harm,  and  AM  never  detected  its 
mistake. 

**  Task  221  Chock  oxamplos  of  Timos-to'ov,  because  many  examples  have  recently  been 
found,  but  not  yet  checked.  The  domain  always  seems  to  contain  an  even  number.  So 
conjectured.  I.e.,  if  the  prodxut  of  some  numbers  is  even,  then  so  is  one  of  those  numbers.  This 
conjecture  is  true,  at  least. 

*«  Task  222  Fill  in  oxamplos  of  Solf'Union,  because  none  exist  yet.  Reaching  way  back  in 
time.  Vgh.  Many  found. 
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»«  T«tK  223  **  ChtcK  •xtmplas  Of  Stif-Union,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  Nothing  special  noticed. 

*«  Task  224  **  Fill  in  oxamplot  of  SubSot,  because  none  exist  yet.  Many  found. 

**  Task  225  Chock  oxampio  of  SubSot,  because  many  examples  have  recently  been  found, 
but  not  yet  checked,  and  Focus  of  Attention.  Nothing  special  noticed. 

»«  Task  226  **  Fill  in  oxamplot  of  SuporSof,  because  none  exist  yet.  Many  found. 

**  Task  227  *«  Chock  oxamplos  of  SuporSot.,  because  many  examples  have  recently  been 
found,  but  not  yet  checked,  and  Focus  of  Attention.  AM  notices  that  if  <x,y>  are  related  by 
Subset,  then  Reverse-ord-pair(<x,y>)  ftre  related  by  Superset,  and  conversely.  This  is  the 
base  connection  between  union  and  intersection  (see  Tasks  29  and  39,  where  these  two 
concepts  are  defined).  That  is,  xcy  iff  ysx. 

**  Tofk  22?  **  Fill  in  oxamplot  of  CompoiooCompoto*!,  because  none  exist  yet.  AM  creates 
some  poor  combinations  (e.g.,  SquareoCountoADD*'},  some  explosive  ones  (e.g., 
(ComposeoCompose)o(Compose8Compose)o(ComposeoCompose)),  and  even  a  few  -  very  few 
-  winners  (e.g.,  SUBUCountoSelf-Insert).  This  is  too  much  like  throtiting  "flying,  hooked 
atoms"  up  into  the  air,  and  hoping  that  three  of  them  collide  fortuitously.  While  a  little 
guidance  may  help  you  to  find  go^  collisions  of  2  such  fliers,  the  combinatorial  explosion 
swamps  the  poor  researcher  when  he  takes  them  on  three  at  a  time.  As  St.  Augustine  observed, 
the  Latin  'cogito'  derives  from  'shake  together^,  but  'intelligo*  drives  from  'select  among'. 

**  Task  229  **  Chock  oxamplot  of  CompotooCompoto-l,  because  many  examples  have 
recently  been  found,  but  not  yet  checked,  and  Focus  of  Attention.  Nothing  interesting  to 
find. 

««  Task  230  ««  Fill  in  oxamplot  of  CompotooCompoto>2,  because  none  exist  yet.  Recall  that 
the  difference  between  this  operation  and  the  last  one  is  merely  in  the  order  of  the 
composing:  Fo(GoH)  versus  (FoG)oH.  AM  recreates  many  of  the  previous  tasks'  operations. 

**  Task  231  ««  Chock  oxamplos  of  CompoiooCompoto*2,  because  many  examples  have 
recently  been  found,  but  not  yet  checked,  and  Focus  of  Attention.  Nothing  noticed  yet. 
Later  on,  AM  finds  that  one  after  another  of  the  operations  created  in  the  preceding  task  as, 
say,  ComposeoCompose-I(F,GfI),  is  really  the  same  as  the  corresponding  operation  created  as 
Compose^ompose-2(F,Cfi).  Eventually,  AM  conjectures  that  those  two  ComposeoCompose 
operations  are  really  the  same;  that  is,  Compose  is  associative. 

**  Task  232  «* ,  •  252.  Fill  in  and  check  examples  of  the  losing  compositions  Just  created. 

**  Task  253  «*  Fill  in  oxamplos  of  Add*sq,  because  none  exist  yet,  and  Add-sq  is  related  to 
the  Interesting  concept  Add.  Recall  that  Add-iq  is  just  addition,  restricted  to  perfect 
squares.  Many  examples  found. 


Appendix  6 


AM  Oiecovery  in  Mtlheimliet  it  Heurittie  Search 


-316- 


»*  Tetk  254  **  Check  domain/ran|«  entries  of  Add>sq,  because  the  range  may  also  be  Perfect- 
squares,  and  examples  of  Add-sq  were  Just  filled  in  (hence  Focus  of  Attention),  and  Add-sq 
is  related  to  the  interesting  concept  Add.  The  range  isn’t  Perfect-squares  (e.g.,  4^9  is  not 
square),  but  some  values  are,  so  AM  defines  the  predicate  Add-sq-sq(x,y),  which  is  True  iff 
X  and  y  are  perfect  squares  and  their  sum  is  a  perfect  square  as  well  (e.g,,  Add-sq-sq(i6,9)). 

Add-sq-sq.Defn  Is  a  predicate  which  is  true  If  its  3  arguments  are  squares,  say  x^,  y2  ^2^ 
and  if  the  sum  of  the  first  two  is  equal  to  the  third:  x^+y^-z^. 

»»  Tick  255  «»  Fill  in  oxamples  of  Add-pr,  because  none  exist  yet,  and  Add-pr  is  related  to 
the  interesting  concept  Add.  That  is,  the  sum  of  a  pair  of  primes. 

**  Tack  256  «*  Chock  Domain/rango  ontrios  of  Add*pr,  because  many  examples  have  recently 
been  found,  but  not  yet  checked.  AM  defines  the  set  of  pairs  of  primes  whose  sum  is  also  a  . 
prime  (e.g.,  Add-pr-pr(2,5)).  In  a  rather  bizarre  way,  AM  has  defined  prime  pairs.  The  sum 
of  two  primes  can  be  a  prime  iff  one  of  them  is  2.  So  Add-pr-pr  can  really  be  considered  a 
predicate  on  one  prime  argument  x,  which  returns  True  iff  x*2  is  a  prime;  i.e.,  iff  x  is  the 
lower  member  of  a  prime  pair.  There  is  something  at  once  awful  and  sublime  about  this 
derivation  of  prime  pairs.  Perhaps  this  captures  the  spirit  of  AM^s  actions  as  a  whole,  so  let's 
stop  this  trace  right  here. 

Appendix  5.1  An  ^Unadulterated*  Trace 

Here  is  the  way  that  the  AM  program  begins.  The  human  user’s  typing  will  appear  in 
italics^.  He  first  types  (START)  to  start  the  system,  after  which  AM  asks  him  some 
questions.  Finally,  the  main  SelecticExecute-a-TASK  loop  of  AM  is  entered. 

The  careful  reader  will  notice  several  small  changes  in  this  transcript,  compared  to  the 
nicely  doctored  ones  which  preceded  it.  For  one  thing,  the  task  numbering  here  is  not 
precisely  the  same  as  in  the  rest  of  this  document.  A  task  is  called  a  "Cand",  and  the  agenda 
is  called  "CANDS".  Only  some  of  the  reasons  are  printed  out,  and  they  are  not  as  "chatty" 
as  the  reasons  in.  e.g..  Chapter  2’s  example  trace.  The  user  has  asked  AM  to  type  out  the 
top  three  tasks  on  the  agenda  at  each  "cycle".  In  a  better  hardware  environment,  the  user 
could  dynamically  watch  the  top  hundred  tasks  bubbling  around  on  one  side  of  a  CRT 
screen.  To  interrupt  AM,  the  user  types  CONTROL-I.  At  that  moment  he  has  a  very 
limited  syntax  of  questions  he  may  ask.  See  (ot)  below  (page  319). 

An  approximate  level  of  familiarity  of  the  user  with  the  AM  program  is  maintained  by 
AM,  as  a  numeric  variable.  Initially,  its  value  is  determined  by  the  number  of  times  the 
human  user  has  used  AM  in  the  past.'°  It  gradually  changes  in  value  as  a  single  session 
proceeds.  Many  print  statements  use  this  variable  to  determine  the  necessary  level  of  detail 
to  type.  For  example,  contrast  the  line  pointed  to  by  an  arrow  labelled  (fi)  below  with  the 
line  labelled  (<).  In  between,  the  variable  increased  to  the  point  where  a  detailed  message 


A 

Thi*  i«  not  •  doclorir)|;  I  htvt  writttn  in  i/o  routini  for  AM  which  print*  24  btfor*  ovirythinc  1h*  u**r  iyp**,  and  2t 
•ftirwird*.  Th*  'PUB'  docum«ntition  profrim  intirprat*  thi*  to  main  ”*witch  to  font  4”  and  "awitch  back 
to  font  I*.  Thi*  document  wa*  PUBBad  with  font  4  dafinad  a*  italic*.  , 

I  ahall  raaiat  th*  temptation  to  call  thi*  a  aimpi*  *u**r  modal*,  even  in  a  footnot*. 


■£>  KT^.  - 
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was  thought  to  be  superfluous.  The  level  of  detail  needed  for  clarity  should  not  be  confused 
with  the  level  of  verbosity  of  output  that  is  desired.  Should  the  user  see  every  function  call, 
or  the  results  of  each  task,  or  just  monitor  the  overall  character  of  what  AM  is  doing?  A 
separate  variable  is  maintained  for  this  verbosity  indication.  Finally,  another  partially 
independent  dimension  is  the  amount  of  control  the  user  wishes  to  have  on  AM.  Must  he 
approve  each  move  AM  makes,  occasionally  redirect  AM,  be  able  to  ask  questions?  Another 
variable  indicates  the  level  of  user  control  in  effect  at  the  moment.  The  first  several 
questions  deal  with  setting  up  these  parameters. 


INTERLISP-10  4-JAN-75 ... 

Good  afternoon,  Doug. 

HSTART) 

Please  type  in  your  last  name  ( then  carriage-return):F£/G£A?fiAC/M 
Almost  ready  to  enter  AM’s  main  loop,  Ed.  ' 

Ed,  you  have  used  AM  once  before. 

Do  you  feel  like  you  need  some  refreshing  about  how  to  work  with  it?  Yts 

AM  has  113  concepts  to  start  with,  each  with  only 
about  5  of  its  potential  30  facets  (  parts)  filled  In. 

Repeatedly,  AM  selects  a  part  of  a  concept, 
and  tries  to  fill  it  in  or  check  it.  In  this  process,  new 
concepts  may  emerge  and  be  granted  full  statust  in  those  cases, 
almost  ail  their  parts  will  be  empty  at  the  time  of  their  creation. 

CANOS  is  a  list  of  suggested  future  activities  for  AM. 

Repeatedly,  AM  picks  a  Candidate  from  CANDS  and  does  what  it  says. 

A  typical  Cand  might  be: 

(  Fill  in  examples  of  interesting  compositions). 

Each  Cand  also  has  a  list  of  reasons  explaining  why  it  was 
proposed,  and  a  numeric  rating  of  its  ovarall  value. 

DO-THRESHhold  is  a  numeric  variable  that  indicates  the  lowest 
rating  a  Cand  may  have  and  still  be  executed  by  AM. 

If  no  Cand  on  CANDS  measures  up,  then  all  the  concepts  try  to  suggest 
new  candidates,  which  are  merged  into  CANDS. 

DISCLAIMER:  The  user  interface  is  still  unfinished,  and  in  fact 
the  only  polished  part  is  the  following  message  describing  it!  Try  it 
at  your  own  risk! 


You,  Ed,  can  affect  this  concept  growing  process. 

At  any  time,  you  may  hit  tl,  which  will  Interrupt  me. 

Once  interrupted,  I  will  answer  one  question  or  perform  one  task; 
a  typical  question  I  can  answer  is  WHY; 

a  typical  task  is  (  Raise  the  Interest  Level  of  the  Frobnate  Concept). 
A  second  way  to  interact  with  me  is  to  help  me  decide 
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which  Cand  to  do  next  each  time.  You  can  see  my  top  choices,  their 
reasons,  and  overrule  me  if  you  want.  The  variable  Seencands  is 
the  number  of  Candidates  you  seo  each  time,  and  Ucontrol  indicates 
the  amount  of  control  you  have  over  my  choosing. 

To  Keep  you  informed,  I  will  periodically  print  out  messages. 

The  level  of  verbosity  can  be  changed  by  interrupting  me. 

More  details  can  be  obtained  when  you  interrupt  with  tl. 

Verbosity  Level  (1-10,  or  ?) ....  (  Please  terminate  your  response 
with  a  carriage-return) ...? 

Verbosity  -  the  amount  of  data  that  AM  spews  out  to  Ed. 

In  this  system.  Verbosity  is  just  a  numeric-valued  variable 
which  the  Printing  functions  look  at. 

Verbosity  level  0  suppresses  all  messages. 

Verbosity  level  5  lets  most  important  messages  get  printed. 
Verbosity  level  10  dumps  out  enough  to  actually  get  some 
feeling  for  the  inner  workings  of  AM. 

The  suggested  value  for  Ed  is  8. 

Verbosity  Level  (1-10,  or  ?) ....  (  Please  terminate  your  response 
with  a  carriage-return)  ,J0 

User-control  Level  (0-10,  or  ?  ) ....? 

User-control  «  the  degree  to  which  you  supervise  AM’s  activities 
In  this  system.  User-control  is  just  a  numeric-valued  variable 
which  the  top-level  control  functions  look  at. 

User-control  level  0  lets  you  gain  control  only  via  tl. 
User-control  level  5  lets  you  see  a  few  of  Afi^s  alternatives, 
and  waits  6  seconds  for  you  to  OK  its  choice. 

User-control  level  10  displays  several  alternative  Candidates, 
and  waits  indefinitely  until  you  to  select  one. 

User-control  Level  (0-10,  or  ?  )  ....4 

Before  deciding  which  new  Cand  to  work  on,  I’ll  print  my  top  choices. 

How  many  Candidates  would  you  like  to  see  each  time? 

( I  suggest  4):  S 

Should  I  tell  you  my  reasons  for  each  Cand  I  am  considering  selecting 
each  time?  No 

Should  I  tell  you  my  reasons  for  the  Cand  I  actually  select  each  time? 
Yes 

If  you  have  not  typed  anything  within  4  seconds  after  a  prompt, 
then  AM  will  fill  in  a  default  answer  for  you. 

A  space  will  suffice  to  keep  AM  from  defaulting  on  you, 
while  you  think  about  what  to  reply  to  any  question  AM  asks  you. 
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In  general,  your  response  should  be  terminated  by  a  carriage  return. 
Would  you  like  to  reset  this  waiting  time?  No 
Entering  AM's  main  loop  now. 


No  Cand  on  CANDS  is  good  enuf. 

Do-thresh  reduced  from  500  to  333 

Must  find  new  candidates  and  merge  them  into  CANOS. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Set-struc-intersect 
2:  Fill  in  some  examples  of  Set-struc-join 
3:  Fill  in  some  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  reason  for  considering  this  Cand  is:  (We  have  no  examples 
for  SET-STRUC-INTERSECT  yet) 


Beginning  1st  cycle. 


Failed.  Tried  to  fill  in  new  examples  of  SET-STRUC-INTERSECT. 

<At  this  moment,  the  user  hit  control-1  and  interrupted  AM.> 

?:  (W,  I,  E,  M,  N,  ?,  Q)  f 


Here  are  more  detailed  explanations  of  your  options: 

W  Why:  AM  gives  Ed  the  explanation  behind  its  last  printed 
message. 

I  Interest:  Ed  can  modify  the  interest  ratings  of  concepts  and 
Candidates. 

E  Evaluate:  Ed  types  in  an  expression  and  AM  runs  EVAL  on  it. 

M  Message:  What  was  the  last  message  that  AM  did  NOT  type  out 

because  the  verbosity  was  too  low? 

N  Name:  Rename  some  concept  to  whatever  you  want  to  call  it. 

C  Quit:  resume  execution. 

In  general,  AM  will  automatically  resume  execution  after  answering  one 
query.  You  must  hit  tl  again  to  interrupt. 


?:  W 

Why:  (No  examples  of  SET-STRUC-INTERSECT  were  found;  there 
is  no  reason  to  even  consider  specializing  it  further) 


This  Cand  used  11.159  cpu  seconds. 
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The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Set-struc 
2:  Fill  in  some  examples  of  Coalesce 
3:  Fill  In  some  examples  of  Nonempty^struc 

I  choose  first  Cand.  OK?  ^ti. 

The  2  reasons  for  considering  this  Cand  are: 

(Active-exs  specifically  asked  for  some  examples  of  SE'''-STRUC 
,  while  trying  to  Fill  in  some  Set-struc-intersect  examples) 

(We  have  no  examples  for  SET-STRUC  yet) 


Beginning  2nd  cycle. 


Creating  new  Being,  similar  to  SET-STRUC,  named  INT-SET-STRUC,  but 
restricted  so  as  to  make  it  more  interesting. 

An  INT-SET-STRUC  is  any  SET-STRUC  for  which  (Each  pair  of 
elements  satisfies  the  same  interesting  predicate  P  (for  some  P)). 

Filled  in  examples  of  SET-STRUC. 

0  examples  existed  originally  on  SET-STRUC. 

1 1  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are; 

(CLASS) 

(CLASS  DOUG  CORDELL  BRUCE) 

(CLASS  RO-7  Rl-7  R2-7  R3-7  R4-7  R5-7  R6-7  R7-7) 

(CLASS  A) 

(CLASS  B) 

(CLASS  A  B) 

(CLASS  0  0  F  I  M) 

After  eliminating  duplicate  and  already-known  entries,  AM  finds  that, 
only  7  new,  distinct  examples  of  SET-STRUC  had  to  be  added. 


Do-thresh  raised  from  332  to  346  because  this  last  Cand  succeeded,  so 
we  raise  our  hopes—  and  our  st'  ndards—  temporarily. 


This  Cand  used  23.743  cpu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Int-set-struc 
2:  Fill  in  some  examples  of  Coalesce 
3:  Check  all  examples  of  Set-struc 

I  choose  first  Cand.  OK?  yes. 

The  reason  for  considering  this  Cand  is:  (Any  example  of 
INT-SET-STRUC  is  automatically  an  interesting  example  of  SET-STRUC) 
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Eieginning  3rd  cycle. 

Won’t  try  to  create  a  restricted  interesting  version  of  INT-SET-STRUC. 

Filled  in  examples  of  INT-SET-STRUC. 

0  examples  existed  originally  on  INI  "SET-STRUC. 

13  potential  new  entries  were  jjst  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 

(CLASS) 

(CLASS  A) 

(CLASS  B) 

After  eliminating  duplicate  and  already-known  entries,  AM  finds  that, 
only  3  new,  distinct  examples  of  INT-SET-STRUC  had  to  be  added. 


Do-thresh  raised  from  346  to  358. 
This  Cand  used  11.881  cpu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Obj-equal 
2:  Check  all  examples  of  Int-set-struc 
3:  Check  all  examples  of  Set-struc 

I  choose  first  Cand.  OK? 

The  reason  for  considering  this  Cand  is:  (We  have  no  examples 
for  OBJ-EQUAL  yet) 


Beginning  4th  cycle. 

Record  of  attempts  to  find  examples: - 

An  ex  (  sought)  is:  ((CLASS  A),(CLASS  A)  -*  T)  + - - - 

— - - - - - - 

- + - + - + — 

Found  6  examples  (  and  151  non-exs),  in  11.644  secs. 

Ratio  of  exs  to  non-exs  is  too  low  (  6  /  151);  Exs  are  too  sparse. 

AM  will  sometime  try  to  generalize  OBJ-EQUAL. 

Won’t  try  to  create  a  restricted  interesting  version  of  OBJ-EQUAL. 

Filled  in  examples  of  OBJ-EQUAL. 

0  examples  existed  originally  on  OBJ-EQUAL. 

6  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 
((CLASS  A)  (CLASS  A)  -»  T) 

((CLASS  0  D  F  I  M)  (CLASS  0  D  F  I  M)  -♦  T) 

(FALSE  FALSE  -♦  T) 
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After  eliminating  duplicate  and  already-known  entries,  AM  finds  that, 
only  3  new,  distinct  examples  of  OBJ-EQUAL  had  to  be  added. 


DO'thresh  raised  from  358  to  359. 
This  Cand  used  17.886  cpu  seconds. 


No  Cand  on  CANDS  is  good  enuf. 

Do-thresh  reduced  from  359  to  239 

Must  find  new  candidates  and  merge  them  into  CANOS. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Set-struc-intersect 
2:  Check  all  examples  of  Int-set-struc 
3:  Fill  in  some  generalizations  of  Obj-equal 


I  choose  first  Cand.  OK?  yes. 

The  reason  for  considering  this  Cand  is:  <We  have  no  examples 
for  SET-STRUC-INTERSECT  yet) 

AM  recently  tried  this  same  Cand,  so  let's  skip  it  now. 


The  top  3  Cands  are: 

1:  Check  all  examples  of  Int-set-struc 
2:  Fill  in  some  generalizations  of  Obj-equal 
3:  Check  all  examples  of  Set-struc 

I  choose  first  Cand.  OK?  yes. 

The  reason  for  considering  this  Cand  is:  (Some  new  ,  unchecked 
examples  of  INT-SET-STRUC  have  recently  been  added) 


Beginning  5th  cycle. 

AM  is  forgetting  the  entire  SUGG  facet  of  the  INT-SET-STRUC  concept. 
Because:  (No  sense  using  this  suggestion  more  than  once). 

Checked  examples  of  INT-SET-STRUC  and  all  entries  were  confirmed 

This  Cand  used  11.362  cpu  seconds. 


The  top  3  Cands  are: 
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1:  Check  all  examples  of  Set-struc 

2:  Fill  in  some  generalizations  of  Obj-equal 

3:  Fill  in  some  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  reason  for  considering  this  Cand  is:  (Some  new  ,  unchecked 
examples  of  SET-STRLK!  have  recently  been  added) 


Beginning  6th  cycle. 


Based  on  empirical  experiments,  AM  believes  that  SET-STRIX  may  really 
be  no  more  specialized  than  UNORD-OBJ. 

Closer  inspection  reveals  that  the  evidence  for  this  was  quite  flimsy. 

AM  will  wait  until  some  examples  of  any  of  these  have  been  found:  ( 
BAG-STRUC),  and  then  see  if  they  truly  also  are  SET-STRUC’s. 


Based  on  empirical  experiments,  AM  believes  that  SET-STRUC  may  really 
be  no  more  specialized  than  NONMULT-STRUC. 

Closer  inspection  reveals  that  the  evidence  for  this  was  quite  flimsy. 

AM  will  wait  until  some  examples  of  any  of  these  have  been  found:  ( 
OSET-STRUC),  and  then  see  if  they  truly  also  are  SET-STRX’s. 

Checked  examples  of  SET-STRX. 

5  entries  were  there  initially. 

1  small  modifications  had  to  be  made. 

5  entries  are  present  now. 


This  Cand  used  8.008  cpu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Bag-struc 
2:  Fill  in  some  examples  of  Oset-struc 
3:  Fill  in  some  generalizations  of  Obj-equal 

I  choose  first  Cand.  OK?  yes. 

The  reason  for  considering  this  Cand  is:  (We  have  no  examples 
for  BAG-STRX  yet) 


Beginning  7th  cycle. 


Filled  in  examples  of  BAG-STRX. 


Appendix  5 


AM  Oiteovtry  in  Mithenutict  n  Heurittic  Seirch 


0  examples  existed  originally  on  BAG-STRUC. 

19  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 

(BAG) 

(BAG  A) 

(BAG  B) 

(BAG  A  B) 

(BAG  A  A) 

(BAG  A  A  B) 

(BAG  0  D  F  I  M) 

(BAG  A  B  (BAG  B)  (CLASS)) 

(BAG  BRUCE  CORDELL  DOUG) 

(BAG  RO-7  Rl-7  R2-7  R3-7  R4-7  R5-7  Ru-/  R7-7) 

After  eliminating  duplicate  and  already-known  entries,  AM  finds  that, 
only  10  new,  distinct  examples  of  BAG-STRUC  had  to  be  added. 


XEQ-CAND 

Do-thresh  raised  fr  n  239  to  264. 
This  Cand  used  17.692  epu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  generalizations  of  Obj-equal 
2:  Fill  in  some  examples  of  Oset-struc 
3:  Fill  in  some  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (The  ratio  of  examples  to  non-examples  of 
OBJ-EQUAL  is  too  low  i  0BJ-E(}UAL  is  too  specialized  ,  too  narrow) 


Beginning  8th  cycle. 

Considering  genlizing  a  recursive  defn  of  OBJ-EQUAL 
Will  try  to  remove  a  conjunct. 

2  possible  conjuncts  to  choose  from. 

AM  generalizes  OBJ-EQUAL  into  the  new  concept  GENL-OBJ-EQUAL,  by 
not  recursing  on  the  CAR  of  each  arg. 
i.e.,  GENL-OBJ-EQUAL  will  not  have  a  recursive  check 
like  this  one,  which  is  present  in  06J-E(}UAL: 

APPLYB 

(QUOTE  OBJ-EQUAL) 

(QUOTE  DEFN) 

(CAR  BAD 
(CAR  BA2) 

AM  generalizes  OBJ-EQUAL  into  the  new  concept  GENL-OBJ-EQUAL- 1, 
by  not  recursing  on  the  CDR  of  each  arg. 
i.e.,  GENL-0BJ-E(}UAL-1  will  not  have  a  recursive  check 
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like  this  one,  which  is  present  in  OBJ-EQUAL: 

APPLYB 

(QUOTE  OBJ-EQUAL) 

(QUOTE  DEFN) 

(CDR  BAD 
(CDR  BA2) 

If  any  of  (GENL-OBJ-EQUAL  GENL-OBJ-EQUAL-D  ever  seems  to  be  too 
specialized,  AM  will  consider  disjoining  it  with  other  members  of  that 
set. 

Filled  in  generalizations  of  OBJ-EQUAL. 

0  generalizations  existed  originally  on  OBJ-EQUAL. 

2  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  generalizations  are: 
GENL-OBJ-EQUAL 
GENL-OBJ-EQUAL-1 

After  eliminating  duplicate  and  already-known  entries,  AM  finds  that, 
all  2  new,  distinct  generalizations  of  OBJ-EQUAL  had  to  be  added. 


Do-thresh  raised  from  264  to  335. 
This  Cand  used  6.667  cpu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  (jenl-obj-equal-1 
2;  Fill  in  some  examples  of  Genl-obj-equal 
3:  Fill  in  some  examplls  of  Oset-struc 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (The  generalization  GENL-OBJ-EQUAL-1  of  OBJ-EQUAL 
is  relatively  new  and  has  no  exs  of  Its  own  yet ,  excepting  those 
of  OBJ-EQUAL) 


Beginning  9th  cycle. 


?:N 

Rename  which  existing  concept?  GENL'OBJ-EQIJAL 
What  is  its  new  name?  SAME-SIZE 
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Record  of  attempts  to  find  examples: 

An  ex  (  sought)  is:  ((VECTOR  BAG)  (VECTOR  B  (BAG  B)  (CLASS)  A))+ - ++ 

-+ — + - ++-+—++-+ - ++ - +— + - + - — 

- ++--+ - ++ - +-+ - + 


Found  26  examples  (  and  IGJ  non-exs),  in  8.037  secs. 

A  nice  ratio  of  exs/non-exs  was  encountered  for  GENL'OBJ-EQUAL-l 
Won't  try  to  create  a  restricted  interesting  version  of 
GENL-OBJ-EQUAL-1. 


Filled  in  examples  of  GENL-OBJ-EQUAL-1. 

0  examples  existed  originally  on  GENL-OBJ-EQUAL-1. 
26  potential  new  entries  were  just  proposed. 


Eliminating  duplicates,  the  newly  constructed  examples  are: 

((VECTOR  BAG)  (VECTOR  B  (BAG  B)  (CLASS)  A)  -♦  T) 

((OSET  0  D  F  I  M)  (OSET  0  0  F  I  M)  -♦  T) 

((BAG)  (BAG  DON  ED)  -♦  T) 

((OSET  D  M  I  F  0)  (OSET  D  M  I  F  O)  T) 

((PAIR  DOUG  BRUCE)  (PAIR  DOUG  BRUCE)  -*  T) 

((VECTOR  BAG)  (VECTOR  D  M  I  F  0)  -*  T) 

((VECTOR  B)  (VECTOR  D  M  I  F  0)  -4  T) 

((BAG  B)  (BAG  B)  -♦  T) 

((VECTOR  D  M  I  F  0)  (VECTOR  A  A  B)  t  T) 

((BAG  A)  (BAG  A  B)  T) 

((VECTOR)  (VECTOR  B  (BAG  B)  (CLASS)  A)  T) 

((OSET  BRUCE  DON)  (OSET  B  A)  T) 

((PAIR  COMPOSE-EXS  COMPOSE-EXS)  (PAIR  LIST-STRUC-INTERSECT 
ANYB-SPEC)  -» T) 

((OSET  R2-1  R2-2  R2-3  R2-4  R2'5  R2-6  R3-1  R3-2  R3-3  R3-4  R3-B 
R3-6  R4-1  R4-2  R4-3  R4-4  R4-5  R4-6  R5-i  R5-2  R5'3  R5-4  n5-5  R5-6  R6-1 
R6-2  R6-3  R6-4  R6-5  R6-6)  (OSET  0  D  F  I  M)  T) 

((OSET  A  B  (BAG  B)  (CLASS))  (OSET  3  (SAG  B)  (CLASS)  A)  T) 

((OSET  0  D  F  I  M)  (OSET  B)  -♦  T) 

((VECTOR  A  A)  (VECTOR  A  B)  -♦  T) 

((OSET  DON  ED)  (OSET  BAG)  ->  T) 

((BAG  A  A  B)  (BAG)  4  T) 

((OSET  B)  (OSET  BRUCE  DON)  -♦  T) 

•  ((CLASS  DON  ED)  (CLASS  A)  4  T) 

((PAIR  LIST-STRUC-INSFRT  CANONIZE)  (PAIR  LIST-STRUC-INTERSECT 
ANYB-SPEC)  4  T) 

((VECTOR)  (VECTOR  BAG)  4  T) 

((OSET  A)  (OSET  D  M  I F  0)  4  T) 

((VECTOR  BAG)  (VECTOR  BAG)  4  T) 

After  eliminating  duplicate  and  already-Known  entries,  AM  finds  that, 
only  25  new,  distinct  examples  of  GENL-(3BJ-EQUAL-i  had  to  be  tdded. 


Do-thresh  raised  from  335  to  367, 
This  Cand  used  29.095  cpu  seconds. 
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The  top  3  Canris  are: 

i:  Fil!  in  some  examples  cf  Same-size 
2:  Check  all  exsmples  ot  Genl-obj-equal-i 
3:  Fill  in  some  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  2  reasons  are: 

(Interestingness  of  SAME-SIZE  has  changed  recently) 

<The  generalization  SAME-SIZE  Of  OBJ-EQUAL  Is  relatively  new 
and  has  no  exs  of  its  own  yet  ,  excepting  those  of  OBJ-EQUAL) 


Beginning  10th  cycle. 

Record  of  attempts  to  find  examples; 

An  ex  {  sought)  is:  ((VECTOR  A)  (OSET  B))+ — +— + - + - ++++ - 

- +-+ — +-+ - + - +-+-+—-+-+ - -+-+— - — + — +•— 

- + — + - + 

Found  26  examples  (  and  102  non-exs),  in  8.032  secs. 

A  nice  ratio  of  exs/non-exs  was  encountered  for  SAME-SIZE 
Won't  try  to  create  a  restricted  interesting  version  of  SAME-SIZE. 

Filled  in  examples  of  SAME-SIZE. 

0  examples  existed  originally  on  SAME-SIZE. 

36  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 

((OSET  0  D  F  I  M)  (OSET  0  D  F  I  M)  -♦  T) 

((OSET  0  M  I  F  0)  (OSET  D  M  I  F  0)  T) 

((PAIR  DOUG  BRUCE)  (PAIR  DOUG  BRUCE)  -♦  T) 

((BAG  B)  (BAG  B)  T) 

((OSET  BRUCE  DON)  (OSET  B  A)  -♦  T) 

((PAIR  COMPOSE-EXS  COMPOSE-EXS)  (PAIR  LIST-STRUC-INTERSECT 
ANYB-SPEC)  -♦  T) 

((OSET  A  B  (BAG  B)  (CLASS))  (OSET  B  (BAG  B)  (CLASS)  A)  T) 
((VECTOR  A  A)  (VECTOR  A  B)  ->  T) 

((PAIR  LIST-STRUC-INSERT  CANONIZE)  (PAIR  LIST-STRUC-INTERSECT 
ANYB-SPEC)  -♦  T) 

((VECTOR  BAG)  (VECTOR  BAG)  T) 

((VECTOR  A)  (OSET  B)  -♦  T) 

((BAG  A  B)  (OSET  B  A)  -»  T) 

((CLASS  0  0  F  I  M)  (BAG  0  D  F  I  M)  ->  T) 

((VECTOR  B>  (BAG  A)  -♦  T) 

((PAIR  LIST-STRUC-INTERSECT  ANYB-SPEC)  (PAIR  DOUG  BRUCE)  T) 
((OSET  DON  ED)  (PAIR  LIST-STRUC-INTERSECT  ANYB-SPEC)  -*  T) 
((BAG  0  D  F  I  M)  (VECTOR  D  M  I  F  0)  -♦  T) 

((VECTOR  B)  (BAG  B)  ->  T) 

((OSET  BAG)  (OSET  A)  -♦  T) 

((VECTOR  A  A)  (BAG  A  A)  -»  T) 

((CLASS  A)  (VECTOR  BAG)  -»  T) 

((CLASS  A  B)  (OSET  A  B)  -♦  T) 

((PAIR  COMPOSE-EXS  COMPOSE-EXS)  (OSET  DON  ED)  T) 
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((VECTOR  A)  (OSET  A)  -*  T) 

((OSET  BAG)  (CLASS  A)  T) 

((OSET  A)  (CLASS  A)  -+  T) 

((OSET  B>  (OSET  A)  T) 

((BAG  0  D  F  I  M)  (OSET  0  D  F  I  M)  -»  T) 

((OSET  DON  ED)  (OSET  ED  CORDELL)  ->  T) 

((OSET  ED  CORDELL)  (OSET  B  A)  T) 

((OSET  A)  (BAG  B)  -♦  T) 

((OSET  B  A)  (OSET  A  B)  -♦  T) 

((VECTOR  B  A)  (OSET  ED  CORDELL)  ->  T) 

((OSET  A)  (VECTOR  BAG)  -♦  T) 

((OSET  B  A)  (OSET  DON  ED)  -♦  T)  ‘ 

After  eliminating  duplicate  and  already-known  entries,  AM  finds  that 
only  35  new,  distinct  examples  of  SAME-SIZE  had  to  be  added. 


Do-thresh  raised  from  367  to  406. 
This  Cand  used  21.725  epu  seconds. 


The  top  3  Cands  are; 

1:  Check  all  examples  of  Same-size 
2:  Check  all  examples  of  Geni-obj-equal-1 
3:  Check  all  things  which  just  barely  miss  being  examples  of 
Same-size 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (Some  new  ,  unchecked  examples  of  SAME-SIZE 
have  recently  been  added) 


Beginning  list  cycle. 

Checked  examples  of  SAME-SIZE. 

35  entries  were  there  initially. 

1  had  to  be  completely  discarded. 

4  had  to  be  transferred  elsewhere. 
30  entries  are  present  now. 


Do-thresh  raised  from  406  to  421. 
This  Cand  used  6.917  epu  seconds. 


The  top  3  Cands  are; 

1:  Check  ail  examples  of  Genl-obj-equai-1 

2:  Check  all  things  which  just  barely  miss  being  examples  of 
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Same-size 

3:  Fill  in  some  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (Some  new  ,  unchecked  examples  of 
GENL-OBJ-EQUAL-1  have  recently  been  added) 


Beginning  12nd  cycle. 

Checked  examples  of  GENL-OBJ-EQUAL-1. 
25  entries  were  there  initially. 

1  had  to  be  completely  discarded. 

4  had  to  be  transferred  elsewhere. 
20  entries  are  present  now. 


This  Cand  used  4.711  cpu  seconds. 

No  Cand  on  CANOS  is  good  enuf. 

Do-thresh  reduced  from  421  to  333 

Must  find  new  candidates  and  merge  them  into  CANOS. 


The  top  3  Cands  are: 

1:  Canonize  these  2  arguments:  Genl-obj-equal-1  and  Obj-equal 
2:  Canonize  these  2  arguments;  Same-size  and  Obj-equal 
3:  Fill  in  some  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (It  would  be  nice  to  find  a  canonical  (  with 
respect  to  (jienl-obj-equal-1  and  Obj-equal )  representation  C  for  any 
Object  X  ;  that  is  , 

(  GENL-OBJ-EQUAL-1  x  y  )  iff 
(  OBJ-EQUAL  (  C  x  )  (  C  y  ) ) . 

) 


Beginning  13rd  cycle. 

Experiments  indicate  that  GENL-OBJ-EQUAL-1  is  affected  by  the  varying 
the  type  of  structure  of  its  arguments. 

GENL-OBJ-E()UAL-l  doesn't  look  at  any  elements  of  OBJECT  except  possibly 
the  car  of  the  structure  which  denotes  its  type,  so  AM  replaces  the 
tail  of  OBJECT  by  a  canonical  distinguished  tail,  say  NIL. 

Succeeded! 

Some  conjectures  that  AM  considers  believable: 


Apptndix  5  AM  Ditcovtry  in  Mtlhtmatiet  ••  Hnuristk  S«*rch 

OBJ-EQUAL,  restricted  to  canonical  OBJECT’S,  is  indistinguishable 
from  GENL-OBJ-EQUAL-1. 

There  is  a  powerful  analogy  between 

GENL-OBJ-EQUAL-1 _ _ OBJ-EQUAL 

OBJECT . CANONICAL-OBJECT 

operators  on  and  into  those  operators  restricted  to 

OBJECT. . .CANONICAL-OBJECT 

statements  involving  these.......statements  involving  these 


Do-thresh  raised  from  333  to  341. 
This  Cand  used  9.02  cpu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Canonical-object 

2:  Restrict  the  following:  Genl-obj-equal-1  Canonical-object  Domain 

3:  Canonize  these  2  arguments:  Same-size  and  Obj-equal 

I  choose  first  Cand.  OK?  yts. 

The  reason  is:  (Any  example  of  CANONICAL-OBJECT  is  a  canonical 
example  of  OBJECT) 


Beginning  14th  cycle. 


AM  will  now  try  to  produce  examples  of  CANONICAL-OBJECT  by  running  the 
following  operations: 

(CANONIZE-GENL-OBJ-EQUAL-l&OBJ-EQUAL). 

Won’t  try  to  create  a  restricted  interesting  version  of 
CANONICAL-OBJECT. 

Filled  in  examples  of  CANONICAL-OBJECT. 

0  examples  existed  originally  on  CANONICAL-OBJECT. 

165  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 

(VECTOR) 

(BAG) 

(CLASS) 

(OSET) 

FALSE 

T 

•  TRUE 
(PAIR) 

(T) 
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(NIL) 

(TRUE) 

(FALSE) 


After  eliminating  duplicate  and  already-known  entries,  AM  finds  that, 
only  12  new,  distinct  examples  of  CANONICAL-OBJECT  had  to  be  added. 


Do-thresh  raised  from  341  to  391. 
This  Cand  used  23.827  cpu  seconds. 


The  top  3  Cands  are: 

1:  Restrict  the  following:  Genl-obj-equal-1  Canonical-object  Domain 
2:  Canonize  these  2  arguments:  Same-size  and  Obj-equal 
3:  Fill  in  examples  of  Coalesce 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (GENL-OBJ-EQUAL-1  was  one  of  the  predicates 
which  defined  the  new  concept  CANONICAL-OBJECT  ,  so  it  is  worth 
considering  the  restriction  of  GENL-OBJ-EQUAL-l  to  that  subset  of 
OBJECT ’s) 


Beginning  15th  cycle. 


Succeeded! 

Do-thresh  raised  from  391  to  431. 
This  Cand  used  3.562  cpu  seconds. 


The  top  3  Cands  are: 

1:  Canonize  these  2  arguments:  Same-size  and  Obj-equal 

2:  Fill  in  some  examples  of  Coalesce 

3:  Restrict  the  following:  Obj-equal  Canonical-object  Domain 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (It  would  be  nice  to  find  a  canonical  (  with 
respect  to  Same-size  and  Obj-equal )  representation  C  for  any  Object 
X  :  that  is  , 

(  SAME-SIZE  X  y  )  iff 
(  OBJ-EQUAL  (  C  X  )  (  C  y  ) ) . 

% 

) 
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Beginning  16th  cycle. 


Experiments  indicate  that  SAME>SIZE  is  not  affected  by  varying  the  type 
of  structure  of  its  arguments. 

Experiments  indicate  that  SAME-SIZE  is  not  affected  by  reordering 
elements  of  its  structural  arguments. 

So  any  canonical  arguments  can  be  Bags  and  Sets. 

Experiments  indicate  that  SAME-SIZE  is  affected  by  the  presence  of 
multiple  elements  in  its  structural  arguments. 

So  any  canonical  argumehts  can  be  Bags  a  id  Lists. 

SAME-SIZE  doesn’t  look  at  the  specific  elements  in  OBJECT,  like 
OBJ-EQUAL  does,  so  AM  can  replace  them  all  by  e  single  distinguished 
element,  say  T. 

Succeeded! 

Some  conjectures  that  AM  considers  believable: 

OBJ-EQUAL,  restricted  to  canonical  BAG-STRUC*$,  is  indistinguishable 
from  SAME-SIZE. 

There  is  a  powerful  analogy  between 

SAME-SIZE _ OBJ-EQUAL 

BAG-STRUC _ .CANONICAL-BAG-STRUC 

operators  on  and  into . those  operators  restricted  to 

BAG-STRUC. _  CANONICAL-BAG-STRUC 

statements  involving  these~»...statement$  involving  these 


Do-thresh  raised  from  431  to  457. 
This  Cand  used  17.297  cpu  seconds. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Canonical-bag-struc 

2:  Restrict  the  following:  Same-size  Canonical-bag-struc  Domain 

3:  Restrict  the  following:  Bag-struc-join  Canonical-bag-struc  Domain 

I  choose  first  Cand.  OK?  yts. 

The  reason  is:  (Any  example  of  CANONICAL-BAG-STRUC  is  a  canonical 
example  of  BAG-STRUC) 


Beginning  17th  cycle. 
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AM  will  now  try  to  produce  examples  of  CANONICAL-BAG-STRUC  by  running 
the  following  operations: 

(CANONIZE-SAME-SIZE&OBJ-EQUAL). 

Filled  in  examples  of  CANONICAL-BAG-STRUC. 

0  examples  existed  originally  on  CANONICAL-BAG-STRUC. 

211  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 

(BAG) 

(BAG  T  T) 

(BAG  T  T  T) 

(BAG  T) 

(BAG  T  T  T  T  T) 

(BAG  T  T  T  T) 

(BAG  TTTTTTTTTTTTTTTTTTTTTTTTTTTT 

TT) 

After  eliminating  duplicate  and  alreadyrknown  entries,  AM  finds  that, 
only  7  new,  distinct  examples  of  CANONICAL-BAG-STRUC  had  to  be  added. 


Do-thresh  raised  from  457  to  478. 
This  Cand  used  35.918  cpu  seconds. 


The  top  3  Cands  are: 

1:  Restrict  the  following:  Same-^ize  Canonical-bag-struc  Domain 
2:  Restrict  the  following:  Bag-struc-join  Canonical-bag-struc  Domain 
3:  Restrict  the  following:  ObJ-equal  Canonical-object  Domain 

I  choose  first  Cand.  OK?  •jti. 

The  reason  is:  (SAME-SIZE  was  one  of  the  predicates  which  defined 
the  new  concept  CAI^NICAL-BAG-STRUC ,  so  it  is  worth  considering 
the  restriction  of  SAME-SIZE  to  that  subset  of  BAG-STRUC ’s) 


Beginning  18th  cycle. 


Succeeded! 

Do-thresh  raised  from  478  to  495. 
This  Cand  used  3.311  cpu  seconds. 
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Rename  which  existing  concept?  CANON ICAL-BAG-STRUC 

What  is  Its  new  name?  NUMBER 

Done. 

?i  (W,  I,  E,  M,  N,  ?,  Q) 

Rename  which  existing  concept?  CANON IZE-SAME-SlZEfiOBJ-E^ AL 

What  is  its  new  name?  SIZE 

Done. 

The  top  3  Cands  are: 

1:  Check  all  examples  of  Number 

2;  Restrict  the  following:  Obj-equal  Csnonical<H)bjoct  Domain 
3:  Check  ail  examples  of  Canonical-object 

I  choose  first  Cand.  OK?  yes. 

The  2  reasons  are: 

(Interestingness  of  NUMBER  has  changed  recently) 

(Some  new  ,  unchecked  examples  of  NUMBER  have  recently  been 
added) 

Beginning  19th  cycle. 

Checked  examples  of  NUMBER  and  all  entries  were  confirmed 
This  Cand  used  1.909  epu  seconds. 


The  top  3  Cands  are: 

1:  Check  all  examples  of  Canonical-object 

2:  Check  all  things  which  just  barely  miss  being  examples  of  Number 

3:  Restrict  the  following;  Bag-struc-join  Number  Domain 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (Some  new  ,  unchecked  examples  of 
CANONICAL-OBJECT  have  recently  been  added) 


Beginning  20th  cycle. 


r-\-  ■vr'rz-',-!'^:  •■■.•  Tr^TT *.T* 
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CANONICAL-OBJECT  has  7  examples  which  occupy  1 1  list  cells,  but  is  not 
interesting  enough  to  warrant  taking  up  that  much  space;  so  about  2 
will  be  selected  at  random  and  forgotten. 

Checked  examples  of  CANONICAL-OBJECT. 

12  entries  were  there  initially. 

10  were  never  confirmed  or  rejected. 

2  had  to  be  completely  discarded. 

5  entries  are  present  now. 

This  Cand  used  16.  626  epu  seconds. 

No  Cand  on  CANOS  is  good  enuf. 

Oo-thresh  reduced  from  495  to  340 

Must  find  new  candidates  and  merge  them  into  CANOS. 


The  top  3  Cands  are: 

1:  Fill  in  some  examples  of  Size 

2:  Fill  in  some  examples  of  Coalesce 

3:  Restrict  the  following:  Bag-struc-join  Number  Oomain 

I  choose  first  Cand.  OK?  yes. 

The  reason  is:  (We  have  no  examples  for  SIZE  yet) 


Beginning  21st  cycle. 


Record  of  attempts  to  find  examples: 

An  ex  (  sought)  is:  (BAG  T  T)++++++++++++++++++++++++++ 

Found  26  examples  (  and  0  non-exs),  in  .996  secs. 

A  nice  ratio  of  exs/non-exs  was  encountered  for  SIZE 
Won’t  try  to  create  a  restricted  interesting  version  of  SIZE. 

Filled  in  examples  of  SIZE. 

13  examples  existed  originally  on  SIZE. 

26  potential  new  entries  were  just  proposed. 

Eliminating  duplicates,  the  newly  constructed  examples  are: 

((BAG  T  T)  -» (BAG  T  T)) 

((BAG  T  T  T  T  T)  -*  (BAG  T  T  T  T  T)) 

((BAG  B)  -♦  (BAG  T)) 

((BAG  A  A)  -♦  (BAG  T  T)) 

((BAG  T  T  T)  -♦  (BAG  T  T  T)) 

((BAG  T  T  T  T)  -♦  (BAG  T  T  T  T)) 

((BAG  A  B)  -♦  (BAG  T  T)) 

((BAG  R2-1  R2-2  R2-3  R2-4  R2-5  R2-6  R3-1  R3-2  R3-3  R3-4  R3-5 
R3-6  R4-1  R4-2  R4-3  R4-4  R4-5  R4-6  R5-1  R5-2  R5-3  R5-4  R5-5  R5-6  R6-1 
R6-2  R6-3  R6-4  R6-5  R6-6)  -» (BAG  TTTTTTTTTTTTTTTTTT 
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TTTTTTTTTTTT)) 

((BAG  A  A  B)  (BAG  T  T  D) 

((BAG  0  D  F  I  M)  -♦  (BAG  T  T  T  T  T)) 

((BAG  A)  (BAG  T)) 

((BAG  TTTTTTTTTTTTTTTTTTTTTTTTTTT 
T  T  T)  -♦  (BAG  TTTTTTTTTTTTTTTTTTTTTTTTTTT 
TTT)) 

((BAG  DON  ED)  -» (BAG  T  T)) 

((BAG  A  B  (BAG  B)  -» (CLASS))  (BAG  T  T  T  T)) 

(;BAG  A  B)  (BAG  T  T)) 

Aft^r  •liminating  duplicate  and  already-known  entries,  AM  finds  that, 
only  14  new,  distinct  examples  of  SIZE  had  to  be  added. 


Do-thresh  raised  from  340  to  414. 
This  Cand  used  9.2  cpu  seconds. 
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